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DIAGNOSTIC POLYMORPHISMS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims priority from provisional application 60/120,787, 
filed February 19, 1999, which is hereby incorporated by reference in its entirety for 
5 all purposes. 

BACKGROUND 

This invention relates to detection of individuals at risk for pathological 
conditions based on the presence of single nucleotide polymorphisms (SNPs). 
During the course of evolution, spontaneous mutations appear in the 

10 genomes of organisms. It has been estimated that variations in genomic DNA 

sequences are created continuously at a rate of about 100 new single base changes 
per individual (Kondrashow, J. Theor. Biol, 175:583-594, 1995; Crow, Exp. Clin. 
Immunogenet. , 1 2 : 1 2 1 - 1 28, 1 995) These changes, in the progenitor nucleotide 
sequences, may confer an evolutionary advantage, in which case the frequency of 

1 5 the mutation will likely increase, an evolutionary disadvantage in which case the 
frequency of the mutation is likely to decrease, or the mutation will be neutral. In 
certain cases, the mutation may be lethal in which case the mutation is not passed on 
to the next generation and so is quickly eliminated from the population. In many 
cases, an equilibrium is established between the progenitor and mutant sequences so 

20 that both are present in the population. The presence of both forms of the sequence 
results in genetic variation or polymorphism. Over time, a significant number of 
mutations can accumulate within a population such that considerable polymorphism 
can exist between individuals within the population. 

Numerous types of polymorphism are known to exist. Polymorphisms can be 

25 created when DNA sequences are either inserted or deleted from the genome, for 
example, by viral insertion. Another source of sequence variation can be caused by 
the presence of repeated sequences in the genome variously termed short tandem 
repeats (STR), variable number tandem repeats (VNTR), short sequence repeats 
(SSR) or microsatellites. These repeats can be dinucleotide, trinucleotide, 



WO 00/49169 PCT/US00/04251 



2 

tetranucleotide or pentanucleotide repeats. Polymorphism results from variation in 
the number of repeated sequences found at a particular locus. 

By far the most common source of variation in the genome are single 
nucleotide polymorphisms or SNPs. SNPs account for approximately 90% of 
5 human DNA polymorphism (Collins et al., Genome Res., 8:1229-1231, 1998). 
SNPs are single base pair positions in genomic DNA at which different sequence 
alternatives (alleles) exist in a population. In addition, the least frequent allele must 
occur at a frequency of 1% or greater. Several definitions of SNPs exist in the 
literature (Brooks, Gene, 234:177-186, 1999). As used herein, the term "single 

10 nucleotide polymorphism" or "SNP" includes all single base variants and so 
includes nucleotide insertions and deletions in addition to single nucleotide 
substitutions (e.g. A->G). Nucleotide substitutions are of two types. A transition is 
the replacement of one purine by another purine or one pyrimidine by another 
pyrimidine. A transversion is the replacement of a purine for a pyrimidine or vice 

1 5 versa. 

The typical frequency at which SNPs are observed is about 1 per 1000 base 
pairs (Li and Sadler, Genetics, 129:513-523, 1991; Wang et al., Science, 280:1077- 
1082, 1998; Harding et al., Am. J. Human Genet, 60:772-789, 1997; Taillon-Miller 
et al., Genome Res., 8:748-754, 1998). The frequency of SNPs varies with the type 

20 and location of the change. In base substitutions, two-thirds of the substitutions 
involve the C<->T (G<->A) type. This variation in frequency is thought to be 
related to 5-methylcytosine deamination reactions that occur frequently, particularly 
at CpG dinucleotides. In regard to location, SNPs occur at a much higher frequency 
in non-coding regions than they do in coding regions. 

25 SNPs can be associated with disease conditions in humans or animals. The 

association can be direct, as in the case of genetic diseases where the alteration in the 
genetic code caused by the SNP directly results in the disease condition. Examples 
of diseases in which single nucleotide polymorphisms result in disease conditions 
are sickle cell anemia and cystic fibrosis. The association can also be indirect, 

30 where the SNP does not directly cause the disease but alters the physiological 

environment such that there is an increased likelihood that the patient will develop 
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the disease. SNPs can also be associated with disease conditions, but play no direct 
or indirect role in causing the disease. In this case, the SNP is located close to the 
defective gene, usually within 5 centimorgans, such that there is a strong association 
between the presence of the SNP and the disease state. Because of the high 
5 frequency of SNPs within the genome, there is a greater probability that a SNP will 
be linked to a genetic locus of interest than other types of genetic markers. 

Disease associated SNPs can occur in coding and non-coding regions of the 
genome. When located in a coding region, the presence of the SNP can result in the 
production of a protein that is non- functional or has decreased function. More 

10 frequently, SNPs occur in non-coding regions. If the SNP occurs in a regulatory 
region, it may affect expression of the protein. For example, the presence of a SNP 
in a promoter region, may cause decreased expression of a protein. If the protein is 
involved in protecting the body against development of a pathological condition, this 
decreased expression can make the individual more susceptible to the condition. 

1 5 Numerous methods exist for the detection of SNPs within a nucleotide 

sequence. A review of many of these methods can be found in Landegren et al., 
Genome Res. 9 8:769-776, 1998. SNPs can be detected by restriction fragment length 
polymorphism (RFLP)(U.S. Patent Nos. 5,324,631; 5,645,995). RFLP analysis of 
the SNPs, however, is limited to cases where the SNP either creates or destroys a 

20 restriction enzyme cleavage site. SNPs can also be detected by direct sequencing of 
the nucleotide sequence of interest. Numerous assays based on hybridization have 
also been developed to detect SNPs. In addition, mismatch distinction by 
polymerases and ligases have also been used to detect SNPs. 

There is growing recognition that SNPs can provide a powerful tool for the 

25 detection of individuals whose genetic make-up alters their susceptibility to certain 
diseases. There are four primary reasons why SNPs are especially suited for the 
identification of genotypes which predispose an individual to develop a disease 
condition. First, SNPs are by far the most prevalent type of polymorphism present 
in the genome and so are likely to be present in or near any locus of interest. 

30 Second, SNPs located in genes can be expected to directly affect protein structure or 
expression levels and so may serve not only as markers but as candidates for gene 
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therapy treatments to cure or prevent a disease. Third, SNPs show greater genetic 
stability than repeated sequences and so are less likely to undergo changes which 
would complicate diagnosis. Fourth, the increasing efficiency of methods of 
detection of SNPs make them especially suitable for high throughput typing systems 
5 necessary to screen large populations. 

One disease for which the discovery of markers to detect increased genetic 
susceptibility is critically needed is end-stage renal disease. End-stage renal disease 
(ESRD) is defined as the condition when life becomes impossible without 
replacement of renal functions either by kidney dialysis or kidney transplantation. 

10 Hypertension (HTN) and non-insulin dependent diabetes (NIDDM) are the leading 
causes of end-stage renal disease (ESRD) nationally (United States Renal Data 
System, Table IV-3, p. 49, 1994). There is currently an epidemic of ESRD, due 
mainly to the aging of the American population. The ESRD epidemic is of special 
concern among African Americans where the incidence of ESRD is four- to six-fold 

15 higher than for Caucasians (Brancati et al., J. Am. Med. Assoc., 268:3079-3084, 
1992), but where treatment of hypertension, a causative factor in ESRD, is less 
effective (Walker et al., J. Am. Med. Assoc., 268:3085-3091, 1992). 

There are currently 200,000 patients with ESRD receiving renal replacement 
therapy (dialysis or renal transplantation), with an annual cost of $13 billion. These 

20 numbers will certainly increase as the population of the nation continues to age. 

Since 1980, when complete data became available for the first time, most new cases 
of ESRD have been ascribed to NIDDM or hypertension. The incidence of ESRD 
due to NIDDM or hypertension is still increasing, suggesting that the U.S. is in the 
early phase of an epidemic of ESRD. Preventing ESRD would save at least $30,000 

25 per patient, per year in dialysis costs alone, as well as enhance the patient's quality 
of life and ability to work. It is clearly the ideal method of cost-containment for 
renal disease. Without effective prevention of ESRD, the nation will instead be 
forced to adopt less humane methods of cost-containment, such as denial of access 
(gate-keeping), or rely upon unrealistic expectations about patient reimbursement 

30 rates, etc. 
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An ideal approach to prevention of ESRD would be the identification of any 
genes that predispose an individual to ESRD early enough to be able to counteract 
this predisposition. Knowledge of ESRD-predisposing genes is essential for truly 
effective delay, or, ideally, prevention of ESRD. 

5 SUMMARY 

The present inventor has discovered novel single nucleotide polymorphisms 
(SNPs) associated with the development of end-stage renal disease in patients with 
either hypertension or noninsulin-dependent diabetes mellitus. As such, these 
polymorphisms provide a method for diagnosing a genetic predisposition for the 
1 0 development of end-stage renal disease in individuals. Information obtained from 
the detection of SNPs associated with the development of end-stage renal disease is 
of great value in the treatment and prevention of the disease. 

Accordingly, one aspect of the present invention provides a method for 
diagnosing a genetic predisposition for end-stage renal disease in a subject, 

15 comprising obtaining a sample containing at least one polynucleotide from the 

subject, and analyzing the polynucleotide to detect a genetic polymorphism wherein 
said genetic polymorphism is associated with an altered susceptibility to developing 
end-stage renal disease. In one embodiment, the polymorphism is located in a gene 
selected from the group of genes in Table 1 . 

20 Another aspect of the present invention provides an isolated nucleic acid 

sequence comprising at least 10 contiguous nucleotides from SEQ ID NO: 1 or 2, or 
their complements, wherein the sequence contains at least one polymorphic site 
associated with a disease and in particular end-stage renal disease. 

Yet another aspect of the invention is a kit for the detection of a 

25 polymorphism comprising, at a minimum, at least one polynucleotide of at least 10 
contiguous nucleotides of SEQ ID NO: 1 or 2, or their complements, wherein the 
polynucleotide contains at least one polymorphic site associated with end-stage renal 
disease. 

Yet another aspect of the invention provides a method for treating end stage 
30 renal disease comprising, obtaining a sample of biological material containing at 
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least one polynucleotide from the subject; analyzing the polynucleotide to detect the 
presence of at least one polymorphism associated with end stage renal disease; and 
treating the subject in such a way as to counteract the effect of any such 
polymorphism detected. 
5 Still another aspect of the invention provides a method for the prophylactic 

treatment of a subject with a genetic predisposition to end stage renal disease 
comprising, obtaining a sample of biological material containing at least one 
polynucleotide from the subject; analyzing the polynucleotide to detect the presence 
of at least one polymorphism associated with end stage renal disease; and treating 
10 the subject. 

Further scope of the applicability of the present invention will become 
apparent from the detailed description and drawings provided below. It should be 
understood, however, that the following detailed description and examples, while 
indicating preferred embodiments of the invention, are given by way of illustration 
15 only, since various changes and modifications within the spirit and scope of the 

invention will become apparent to those skilled in the art from the following detailed 
description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 These and other features, aspects, and advantages of the present invention 

will become better understood with regard to the following description, appended 

claims, and accompanying drawings where: 

Figure 1 shows SEQ ID NO: 1 , the nucleotide sequence of the TGF-bl 

promoter region as contained in GenBank (accession no. J04431). 
25 Figure. 2 shows SEQ ED NO: 2, the nucleotide sequence of the TGF-bl 

receptor II promoter region as contained in GenBank (accession no. U37070) 



DEFINITIONS 

nt = nucelotide 
bp = base pair 
30 kb = kilobase; 1000 base pairs 
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ESRD = end-stage renal disease 
HTN = hypertension 

NIDDM = noninsulin-dependent diabetes mellitus 
CRF = chronic renal failure 
5 T-GF = tubulo-glomerular feedback 
CRG = compensatory renal growth 
MODY = maturity-onset diabetes of the young 
RFLP = restriction fragment length polymorphism 
MASDA = multiplexed allele-specific diagnostic assay 
1 0 MADGE = microtiter array diagonal gel electrophoresis 
OLA = oligonucleotide ligation assay 
DOL = dye-labeled oligonucleotide ligation assay 
SNP = single nucleotide polymorphism 
PCR = polymerase chain reaction 
1 5 "Polynucleotide" and "oligonucleotide" are used interchangeably and mean a 

linear polymer of at least 2 nucleotides joined together by phosphodiester bonds and 
may consist of either ribonucleotides or deoxyribonucleotides. 

"Sequence" means the linear order in which monomers occur in a polymer, 
for example, the order of amino acids in a polypeptide or the order of nucleotides in 
20 a polynucleotide. 

"Polymorphism" refers to a set of genetic variants at a particular genetic 
locus among individuals in a population. 

"Promoter" means a regulatory sequence of DNA that is involved in the 
binding of RNA polymerase to initiate transcription of a gene. A "gene" is a 
25 segment of DNA involved in producing a peptide, polypeptide, or protein, including 
the coding region, non-coding regions preceding ("leader") and following ("trailer") 
the coding region, as well as intervening non-coding sequences ("introns") between 
individual coding segments ("exons"). A promoter is herein considered as a part of 
the corresponding gene. Coding refers to the representation of amino acids, start and 
30 stop signals in a three base "triplet" code. Promoters are often upstream ("5 f to") the 
transcription initiation site of the gene. 
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"Gene therapy" means the introduction of a functional gene or genes from 
some source by any suitable method into a living cell to correct for a genetic defect. 

"Wild type allele" means the most frequently encountered allele of a given 
nucleotide sequence of an organism. 
5 "Genetic variant" or "variant" means a specific genetic variant which is 

present at a particular genetic locus in at least one individual in a population and that 
differs from the wild type. 

As used herein the terms "patient" and "subject" are not limited to human 
beings, but are intended to include all vertebrate animals in addition to human 
10 beings. 

As used herein the terms "genetic predisposition", "genetic susceptibility" 
and "susceptibility" all refer to the likelihood that an individual subject will develop 
a particular disease, condition or disorder. For example, a subject with an increased 
susceptibility or predisposition will be more likely that average to develop a disease, 

1 5 while a subject with a decreased predisposition will be less likely than average to 
develop the disease. A genetic variant is associated with an altered susceptibility or 
predisposition if the allele frequency of the genetic variant in a population or 
subpopulation with a disease, condition or disorder varies from its allele frequency 
in the population without the disease, condition or disorder (control population) or a 

20 control sequence (wild type) by at least 1%, preferably by at least 2%, more 
preferably by at least 4% and more preferably still by at least 8%. 

As used herein "isolated nucleic acid" means a species of the invention that 
is the predominate species present (i.e., on a molar basis it is more abundant than 
any other individual species in the composition). Preferably, an isolated nucleic acid 

25 comprises at least about 50, 80 or 90 percent (on a molar basis) of all 

macromolecular species present. Most preferably, the object species is purified to 
essential homogeneity (contaminant species cannot be detected in the composition 
by conventional detection methods). 

As used herein, "allele frequency" means the frequency that a given allele 

30 appears in a population. 
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DETAILED DESCRIPTION 

All publications, patents, patent applications and other references 
cited in this application are herein incorporated by reference in their entirety as if 
each individual publication, patent, patent application or other reference were 
5 specifically and individually indicated to be incorporated by reference. 

Compensatory renal growth 

Several factors are involved in compensatory renal growth. In CRF the renal 
cortex gets progressively thinner. Loss of renal parenchyma thus occurs selectively 
from the compartment in which glomeruli are located. This is the same 

1 0 compartment which enlarges in compensatory renal growth (Fine, Kidney Int. , 

29:619-634, 1986. and refs. therein). As in myocardial hypertrophy, which may lead 
to dilated cardiomyopathy, renal hypertrophy can be the forerunner of tissue loss. 

Hepatocyte nuclear factor- la (HNF-la) plays a role in compensatory renal 
growth. This transcription factor, which is induced by HNF-4a, is present in liver 

15 and other tissues, including the kidney, where it behaves as an immediate early gene 
in compensatory renal growth (Moskowitz and Liu J Urology, 154:1560-1565, 
1995). HNF-la may be an anti-apoptotic factor in the liver as well as in other 
tissues (pancreas, kidney, peripheral neurons, retina), akin to NF-kB, a ubiquitous 
anti-apoptotic transcription factor (Beg and Baltimore Science, 274:782-784, 1996), 

20 since "knock-out" mice for either gene fail to develop as embryos (Duncan et al. 
Science, 281 :692-695, 1998). The NF-kB2 (p65, or RelA) -/- "knockout" mouse 
dies on embryonic day 9 after massive hepatocyte apoptosis (Beg et al., Nature, 
376:167-170, 1995). 

The regulation of intrarenal blood flow is also a factor in compensatory renal 

25 growth. This regulation is quite complex, perhaps more so than for any other organ 
given its extreme blood flow/mass ratio and the exquisite matching of chemical 
reabsoiptive work with blood flow required of the kidney (Navar et al., Physiol 
Rev., 76:425-536, 1996). In rodent compensatory renal growth, the renal cortex 
(where biochemical evidence of growth is apparent within 5 min after contralateral 

30 nephrectomy [Toback et al., J. Clin, Invest., 54:91-97, 1974]) undergoes 

vasoconstriction within 3 min after contralateral nephrectomy (Moskowitz et al., 
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Renal Failure, 18:833-846, 1996). This result was unexpected, since blood flow had 
earlier been found to increase substantially in the renal artery within minutes after 
contralateral nephrectomy (Krohn et al. s J. Urol, 103:564-568, 1970). For blood 
flow to increase in the main renal artery, yet for the renal cortex to undergo 
5 vasoconstriction, suggests that renal arterial blood flow was re-directed from cortex 
to medulla after uninephrectomy. 

This simultaneous vasoconstriction of renal cortex and vasodilatation of 
renal medulla implicates primarily adenosine, since there is a selective distribution 
of vasoconstrictive adenosine Al receptors in renal cortical vessels, and vasodilatory 

10 adenosine A2 receptors in renal medullary vessels (Anderson RJ. Adenosine: 
mechanisms of renal actions, In: Hormones, Autacoids, and the Kidney, eds. 
Goldfarb S, ZiyadehFN. Churchill Livingstone, 1991:281-296). 

Adenosine is released from cells in proportion to the rate of work demanded 
of them and the degree of ischemia to which they are exposed. Adenosine is derived 

1 5 from the breakdown of ATP, which if not replenished from ADP or AMP, is further 
dephosphorylated to adenosine, The medullary thick ascending limb carries out the 
most active (ATP-requiring) ion transport in the nephron, yet has the lowest oxygen 
tension. The medullary thick ascending limb is therefore likely to be the major site 
for adenosine generation along the nephron. 

20 Adenosine has also long been implicated as a possible mediator of tubulo- 

glomerular feedback (Anderson RJ. Adenosine: mechanisms of renal actions, In: 
Hormones, Autacoids, and the Kidney, eds. Goldfarb S, Ziyadeh FN. Churchill 
Livingstone, 1991:281-296). Tubulo-glomerular feedback is characterized by the 
reduction of blood flow to individual nephrons by afferent arteriolar constriction 

25 when blood flow exceeds the capacity of the nephron mass to perform the chemical 
work of solute reabsorption. The renal cortical vasoconstriction that occurs within 3 
minutes after contralateral nephrectomy could be most easily explained by activation 
of tubulo-glomerular feedback. 
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Model for ESRD 

The progression of CRF appears to involve ongoing tubulo-glomerular 
feedback. Bricker's "intact nephron hypothesis" (Bricker et al. Am. J. Med. 28:77- 
97, 1960) focused on the positive feedback loop which exists for remnant nephrons, 
5 unlike nephrons in healthy kidneys. Brenner expanded this concept by calling 
attention to the danger of elevated glomerular capillary pressure (P GC ) and, more 
generally, hyperfiltration by single nephrons. More recently, Brenner has shown 
that there is a threshold number of nephrons (about one million) below which 
progressive renal failure occurs (Brenner et al., Kidney Intl., Suppl., 63:S124-S127, 

10 1997). These observations are all compatible with the normal functioning of tubulo- 
glomerular feedback: called upon briefly , as after contralateral nephrectomy, 
compensatory renal growth (CRG) results; activated continuously , which occurs 
when the functioning renal mass is less than one million nephrons, tubulo- 
glomerular feedback leads to apoptosis. This, in turn, leads to CRF and ESRD. The 

1 5 body is thus apparently designed to obliterate the kidneys altogether (through 

apoptosis) rather than to allow glomerular filtration to exceed tubular capacity for 
reabsorption, which would result in rapid depletion of total body water. 

This model can be visualized as follows: 

Initial loss of nephrons — > T-GF — > >10 6 nephrons remain? — > 

20 — >YES--~> CRG (STOP) 
T 

T 

T 

NO — >continuing loss of nephrons — >apoptosis — > CRF — > ESRD 

25 Thus, factors such as adenosine and HNF-la, which affect tubulo-glomerular 
feedback and (indirectly) CRG, also influence the progression to ESRD. 
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Genetic Predisposition to ESRD 

No more than 30% of patients with NIDDM, and 10% of patients with 
essential hypertension, develop nephropathy, suggesting a genetic component. A 
genetic influence in susceptibility to renal disease is also suggested by the findings 

5 of Yamagata et al., Nature, 384:458-460, 1996; Krohn et al., J. Urol 103:564-568, 
1970; and Anderson RJ. Adenosine: mechanisms of renal actions . In: Hormones, 
Autacoids, and the Kidney, eds. Goldfarb S, Ziyadeh FN. Churchill Livingstone, 
1991 :281-2962L Alterations in the coding sequence of a single major gene are 
responsible for both NIDDM and its complications (including nephropathy) in 

10 "maturity-onset diabetes of the young" (MODY), which is inherited as an autosomal 
dominant disease. Nonsense mutations in the coding sequence of either of two 
transcription factors, HNF-la (MODY 3) or HNF-4a (MODY 1), are sufficient to 
cause disease in heterozygotes. 

MODY resembles other Mendelian (single-gene) disorders, such as cystic 

15 fibrosis and Huntington's disease, in its early onset (usually before the 4th decade) 
and its clear pattern of inheritance. Mendelian disorders, which are relatively rare in 
populations, have so far all involved alterations in the coding sequence of non- 
redundant genes. 

By contrast, the median age for the incidence of ESRD due to hypertension 
20 is 68, and due to NIDDM, 61 (United States Renal Data System. Table IV-3, p. 49, 
1994). Although there is clearly an increased familial risk in ESRD (Bergman et al., 
Am. J. Kid. Dis., 27:341-346, 1996), there is no evidence for the segregation of a 
single major gene, as in MODY. 

In the simplest case (MODY 1 and 3), NIDDM and its complications appear 
25 to result from accelerated apoptosis in the pancreas and affected target organs, 

including the kidney, brought on by having only -50% of the normal activity of an 
anti-apoptotic factor (HNF-la or HNF-4a) ( Yamagata K et al., Nature, 384:455- 
458, 1996; Yamagata K et al., Nature, 384:458-460, 1996). 

Based on the above observations, it is clear that hypertension and NIDDM 
30 are usually not sufficient to cause renal failure, since ESRD is a relatively rare 
complication of each. Also, progressive renal failure due to hypertension or 
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NIDDM is characterized by loss of parenchyma without inflammation (e.g. localized 
pain, fever, sweats). These symptoms define apoptosis (Wyllie AH. Cell death: a 
new classification separating apoptosis from necrosis . In: Cell death in biology and 
pathology, Wyllie AH. Ed, Chapman & Hall, 1981 :9-34), further supporting a role 
5 for renal apoptosis in ESRD. 

Thus, vasoconstriction occurs early in renal growth (Moskowitz et ah, Renal 
Failure 18:833-846, 1996), and CRG involves the same "immediate early" genes 
whose excess activity triggers apoptosis in other systems, e.g. p53, RB, c-myc, and 
cyclinDl (Moskowitz and Liu J Urology 154:1560-1565, 1995 andrefs. therein). 
10 The following points are consistent with the above data, support the model 

for ESRD disclosed above, and provide a rationale for asserting a genetic component 
for susceptibility to ESRD: 

Renal apoptosis, and clinical ESRD, result from overactivity of gene 

products along the growth pathway. 
1 5 - Common, polygenic diseases, such as ESRD, involve subtle variations in the 

level of normal, interacting gene products, rather than dramatic alterations in 

the coding sequence, as occurs in single-gene disorders. 

Renal apoptosis is triggered by the same factors which trigger compensatory 

renal growth (CRG). 

20 The use of candidate genes in case-control association studies is disclosed 

herein to identify genetic variants at polymorphic genetic loci which serve as 
indicators of predisposition to ESRD. The case-control association study is a 
preferred method to find disease susceptibility genes, due to its low cost in 
comparison to pedigree-based positional cloning methods. Association studies are 

25 suited for common, polygenic diseases, in which multiple interacting 

polymorphisms exist, none of which is either sufficient or necessary to cause disease 
(Risch and Merikangas, Science, 273 : 1 5 1 6-1 5 1 7, 1 996). 

The level of expression of a gene product is determined to a significant 
extent by its rate of transcription, which in large part depends on regulatory 

30 sequences within the promoter region. The promoter is usually located immediately 
upstream (5') of the transcription initiation site of the gene. Small changes in the 
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promoter can affect the level of expression of a gene (Spek et al. J. Biol. Chenu, 
273:10168-10173, 1998). In addition, the ACE I/D polymorphism illustrates that 
the presence of onAlu sequence within an intron may retard transcription (see 
Moskowitz, Hypertension, thermo tolerance, and the "African gene": an hypothesis. 
5 Clinical Experimental Hypertension: Part A. Theory and Practice 18:1-19, 1996 for 
refs.). Alu insertions are extremely common, comprising 5% of the entire human 
genome, and occurring on average every 10 kb. 

Thus, a single nucleotide change in a regulatory element (often only 10 bp 
long) in the promoter, or the presence/absence of an A lu sequence within an intron 

1 0 (as in the ACE I/D polymorphism) or within the promoter region, could alter the 
level of expression of a protein important for growth and/or apoptosis. Such 
changes are expected to be more common than coding sequence variations, and 
provide a better substrate for the action of natural selection, because the full activity 
of the gene product is retained. Only the gene's regulation has changed, which may 

15 confer a sufficient selective advantage to result in a relatively high frequency of the 
allele in a population. 

Genes Affecting ESRD-Related Pathways 

Various genes are involved, directly or indirectly, in the development of 
ESRD. The mechanism for their effect on ESRD may be through their involvement 

20 in the development of hypertension, CRG, NIDDM, CRF, MODY, tubulo- 

glomerular feedback, or other conditions. A genetic variant of any of these genes 
may influence the expression of the genes to increase or decrease susceptibility to 
ESRD. While not being bound to any particular theory, the following explanations 
of the potential effects of various genes on the above conditions are believed to 

25 establish a basis for asserting their possible involvement in conferring susceptibility 
to ESRD. 

Additional renal vasoconstrictors involved in triggering compensatory renal 
growth are angiotensin II and norepinephrine. Both have been shown to induce a 
similar subset of the immediate early genes which are also expressed in 
30 compensatory renal growth (Moskowitz and Liu, J. Urology, 154:1560-1565, 1995 
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and refs. therein). These may lead to ESRD rather than merely to hypertension and 
insulin resistance due to activation of molecules farther downstream in the pathway 
of renal growth and apoptosis, as described below. 

Angiotensin II specifically induces TGFb-1 (Lee et ah, J. Mol Cell 
5 Cardiol.* 21:2341-2351, 1995), which induces apoptosis through a nitric oxide- 
independent mechanism, perhaps involving NF-kB (Martin-Sanz et ah, Hepatology 
23:1200-1207, 1996). Activation of protein kinase C (PKC) by angiotensin II, and 
protein kinase A (PKA) by norepinephrine, stimulate NF-kB transcription (see 
below). Activation of both PKC and PKA occurs early during compensatory renal 

10 growth (Moskowitz and Liu, J. Urology, 154:1560-1565, 1995). 

In addition to induction of TGFb-1, angiotensin II appears to be especially 
important in causing renal apoptosis through activation of endothelin-1 (ET-1). ET- 
1 is a more potent vasoconstrictor than angiotensin II, and has already been 
implicated in nephrosclerosis (Rabelink et ah, Kidney IntL 9 50:1827-1833, 1996). 

1 5 Individuals who overexpress ET-1 in response to angiotensin II may develop 

unusually intense and long-lasting local vasoconstriction, hypoxia, and depletion of 
cellular ATP, resulting in apoptosis (Richter et ah, FEBS Letters, 378:107-1 10, 
1996). 

The endothelin A receptor (ETA), which mediates vasoconstriction, could 
20 affect ESRD because of its possible contribution to ET-1 signaling overactivity 

(Benigni et ah, Mineral Electrolyte Metab., 21 : 283-291 , 1 995). Genetic variations 
in the promoter region, introns (for presence/absence of Alu sequence), and coding 
sequence of ETA can thus confer increased basal or stimulated activity to the 
receptor. 

25 The transforming growth factor-b system is also a likely mediator of renal 

apoptosis. TGF-b is intimately connected with glomerular sclerosis, mesangial 
matrix expansion, and tubulointerstitial fibrosis in experimental rodent models and 
human glomerulnephritis (Border et ah, Kidney Intl., 47(Suppl 49):S-59-S-61, 
1995). Of the three isoforms available, TGF-b 1 has been implicated most 

30 consistently in pathologic fibrosis (Khalil et ah, Am. J. Respir. Cell. Mol Biol, 
14:131-138, 1996). Activation of protein kinase C early during CRG would have 
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the effect of stimulating TGF-bl production, since the TGF-bl promoter contains 
AP-1 sites (Kim et al. f J. Biol Chem., 264:402-408, 1989). Angiotensin II has been 
shown to induce TGF-bl expression in renal mesangial cells, endothelial cells, and 
proximal tubular epithelial cells. Thus, greater induction of TGF-bl , or greater 
5 expression of its two main receptors (TGFb-RI and TGFb-RII), may occur in 
patients who progress to ESRD compared to patients who never develop CRF. 

Since the coding sequence of TGF-bl is identical between mouse and 
human, a period of evolutionary divergence of greater than 100 hundred million 
years, no human polymorphisms in the coding sequence are expected. Thus the 
10 TGF-bl promoter and introns would be more likely candidates for genetic variants 
than the exons of the TGF-bl structural gene. The promoter sequences and the 
structural genes for TGFb-RI and TGFb-RII are also likely candidates for genetic 
variations. 

The L-type Ca 2+ channel may be involved in hypertension, especially among 

15 African Americans, since ACE inhibitors are not as effective in blood pressure 

control for African .Americans as Ca 2+ channel blockers directed against the L-type 
calcium channel. The a subunit, a large gene product consisting of 46 exons, 
requires the much smaller b subunit for activity. The b subunit (encoded by the 
CACNLB3 gene) has a regulatory role, and undergoes serine/threonine 

20 phosphorylation to activate the calcium channel. By analogy with other transport 
proteins, the b subunit likely acts as a gate to occlude the ion channel located in the a 
subunit (Lacinova, et al., FEES Letters, 373:103-107, 1995). 

Phosphorylation of the b subunit probably opens the gate by creating 
electrostatic charge repulsion between a negatively charged phosphate (on a serine 

25 or threonine hydroxy 1 group in the b subunit) and a negatively charged amino acid 
(e.g. aspartate or glutamate) in the a subunit which is normally hydrogen-bonded to 
the serine/threonine -OH group. A change in the coding sequence of the 
phosphorylated serine or threonine residue to a negatively charged amino acid (e.g. 
asp or glu, which requires a change of at least 2 nucleotides in the CACNLB3 gene) 

30 would create a constitutively open Ca 2+ channel. Other amino acid changes in the 
coding sequence that destabilize the hydrogen bonding of the ser/thr hydroxyl group 
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might result in a less dramatic, although still supranormal, activation of the L-type 
Ca 2+ channel in response to agonists. Thus, genetic variants in the CACNLB3 gene 
could lead to hypertension and could therefore influence susceptibility to ESRD. 

The consequences of excessive Ca 2+ entry are expected to include increased 
5 systemic vascular resistance from vascular smooth muscle cell contraction, insulin 
resistance, as well as enhanced apoptosis, since cell death is associated with 
accumulation of intracellular, especially intramitochondrial, calcium (Schwartzman 
and Cidlowski Endocrine Rev., 14:133-151, 1993). Indeed, a major inhibitor of 
apoptosis, the bcl-2 gene product, binds calcium at the nuclear and mitochondrial 

10 membranes, and is active in the kidney. 

Impaired bcl-2 function has been shown to be important in mouse and human 
models of renal cystic disease. Bcl-2 -/- "knockout" mice unexpectedly 
demonstrated polycystic kidney disease, associated with accelerated apoptosis 
during nephrogenesis (Sorenson et al., Am. J. Physiol 271 :F184-F193, 1996). 

15 Apoptosis is increased in human autosomal dominant polycystic kidney disease, as 
well as a murine model of polycystic kidney disease (Trudel et al., J. Am. Soc. 
Nephrol 7:1624, 1996, abst.}; c-myc is overexpressed in polycystic kidney disease 
in both species. Variations in the 5 f promoter region and introns are thus consistent 
with reduced expression of bcl-2, and therefore enhanced apoptosis, in patients with 

20 ESRD. 

The genes bcl-1 (encoding cyclin Dl), c-myc, p53, Rb, WT-1, and DR1 have 
been found to behave as "immediate early" genes in CRG (Moskowitz and Liu, J. 
Urology, 154:1560-1565, 1995). Additionally, vHL, the von Hippel-Lindau tumor 
suppressor, is analogous to WT-1, the Wilm's tumor suppressor. Where known, the 

25 promoters of these tumor suppressors contain AP-1 sites, which are sensitive to 

protein kinase C activation (Moskowitz and Liu, J. Urology, 154:1560-1565, 1995). 
Cyclin Dl and c-myc are positive regulators of cellular proliferation, whereas p53, 
Rb, WT-1, DR1, and vHL are negative growth control elements. Where tested, 
these gene products have been shown to induce apoptosis when overexpressed 

30 (Steller, Science, 267:1445-1449, 1995). Genetic variants of promoters or introns 
for each of these genes may lead to overexpression of the gene product. As with 
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previously discussed genes, these genetic variants could take the form of single 
nucleotide additions, deletions, or substitutions, or the insertion or deletion of Alu 
elements. For example, p53 is known to contain an Alu sequence in intron 6 
(GenBank Accession no. U38671), which might be deleted in ESRD patients, 
5 leading to its overexpression. 

Tumor necrosis factor-a (TNFa) is expressed by phagocytes, including renal 
mesangial cells. When bound to its receptor (TNF receptor), apoptotic death of the 
TNFa-bearing cell results. Fas ligand (FasL) is similar to TNFa, although when 
FasL binds to its receptor (Fas), apoptotic death of the Fas-bearing cell results. Both 

1 0 Fas and FasL are expressed in human kidney (Gonzalez-Cuadrado et al., Kidney 
Intl., 49:1064-1070, 1996), and TNFa has been shown to be involved in human 
glomerulonephritis. Here, we hypothesize that these death signals mediate apoptosis 
in progressive CRF due to hypertension or NIDDM. Support for an immunological 
role in CRF comes from the finding of specific HLA associations with hypertensive 

1 5 ESRD (Freedman et al., Am. JHypertens., 4:393-398, 1 991). The promoter, exons, 
and intron/ exon boundaries are known for TNFa and Fas, although not for FasL (see 
Table 1). Furthermore, a genetic variant has already been described in TNFa's intron 
1 (Wilson et al., Human Molec. Genetics 1:353, 1992; GenBank accession no. 
LI 1698). 

20 Bax, a major antagonist of bcl-2, promotes apoptosis in lymphocytes 

(Miyashita and Reed, Cell, 80:293-299, 1995). Promoter and intronic variation (e.g. 
single nucleotide changes or deletions of an Alu sequence) might increase the 
transcription/translation of this gene product. Bak and Bak-2 similarly interact with 
bcl-2 to promote apoptosis. Although none of these three gene products has yet 

25 been shown to function in the kidney, loss of bcl-2 activity leads to polycystic 
kidney disease (see above). 

Nitric oxide (NO) has been strongly implicated in apoptosis of endothelial 
(Bonfoco et al., Proc. Natl. Acad. Sci. USA, 92:7162-7166, 1995) and vascular 
smooth muscle cells (Nishio et al., Biochem. Biophys. Res. Commun., 221:163-168, 

30 1996). Nitric oxide, which is vasodilatory, antagonizes the vasoconstrictive effects 
of angiotensin II and endothelins. Since angiotensin II promotes renal injury, nitric 
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oxide may protect against renal injury from systemic disease such as hypertension 
and non-insulin dependent diabetes mellitus (NIDDM; Bataineh and Raij, Kidney 
Int., Suppl., 68:S140S19 5 1 998) Nitric oxide has also been implicated in the 
progression of renal disease in rats (Brooks and Contino, Pharmacology, 56:257- 
5 261, 1998) and humans (Noris and Remuzzi, Contrib. Nephrol 1 19:8-15, 1996; 
Kone, Am. J. Kidney Dis., 30:31 1-333, 1997; Aiello et ah, Kidney Int., Suppl., 
65:S63-S67, 1998; Raij, Hypertension, 31:189-193, 1998). The nitric oxide 
synthase genes are recognized candidate genes for hypertension, renal failure, and 
cardiovascular in general (Soubrier, Hypertension, 31:189-193, 1998) 
10 NO can directly oxidize (and activate) thiol-containing proteins such as NF- 

kB and AP-1 (Stamler, Cell, 78:931-936, 1994). NO can either promote apoptosis 
or prevent it. Above a threshold concentration, NO seems to stimulate apoptosis 
(Bonfoco et al., Proc. Natl Acad. Set USA, 92:7162-7166, 1995; Stamler, Cell, 
78:931-936, 1994). 

15 The highest amount of NO is made by the inducible NO synthase (iNOS, 

NOS II), which is fully active at the prevailing intracellular calcium concentration 
(Ca { -100 nM), and, once induced, remains active for days, producing nanomolar 
amounts of NO (Yu et al., Proc. Natl Acad. Sci. USA, 91:1691-1695, 1994). The 
cis regulatory sequences for iNOS are not fully known. However, a region of 1798 

20 nucleotides (nt) immediately upstream (5') of the gene has been sequenced. 

Additional regulatory regions far upstream have been found in the human iNOS 
gene (de Vera ME et al., Proc. Natl. Acad. Sci. USA, 93 : 1 054- 1 059, 1 996), but have 
not yet been reported. Increased inducibility of iNOS would have conferred an 
important selection advantage, since iNOS is thought to be the major mechanism for 

25 immune cell-mediated killing of infectious agents such as parasites (e.g. malaria), 
bacteria, and viruses. 

An additional source of renal NO is endothelial constitutive NOS (ecNOS, 
NOS III). ecNOS requires an elevation of Ca, to be active, since it must bind 
calmodulin for activity. ecNOS, which produces picomolar amounts of NO, may 

30 thus seem an unlikely source of large amounts of NO, but it is specifically activated 
by shear stress (Awolesi et al., Surgery, 1 16:439-445, 1994), and may be involved in 
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arterial remodeling. Like endothelin- 1 , ecNOS may therefore account for the 
clinical observation that the rate of progression of CRF is proportional to the degree 
of hypertension. Single nucleotide variations in the 5 f promoter region (1600 nt) of 
ecNOS might thus allow for increased induction. 
5 Finally, NF-kB and its repressor, IkB, play an important role in apoptosis of 

immune cells, including monocyte/macrophage/mesangial cells, and tumor cells. 
NF-kB is a ubiquitous transcription factor which is present in renal tissue (Amoah- 
Apraku et al., Kidney Intl., 48:674-682, 1995). NF-kB binding sites exist in the 
promoterregit^ ^fj* ^anv genes, including pro-apoptotic molecules such as TNFa, 

iO Fas, iNOS, TGFb-1, aw anti-apoptotic gene products such as bcl-2. In this regard it 
is interesting that the production of TNF-b parallels the progression of CRF 
(Descamps-Latshca et al., J Immunol, 154:882-892, 1995), and is perhaps 
responsible for the cachexia of ESRD. NF-kB prevents apoptosis in tumor cell lines, 
but may induce apoptosis in immune cells. Its net effect may depend on the relative 

15 amounts of pro- and anti-apoptotic molecules which it induces in each tissue. 

NF-kB is a heterodimer, composed of p50 (NF-kB 1) which is not required 
for activity (Baeuerle, Biochim. Biophys. Acta. 1072:63-80, 1991), and p65 (also 
called RelA, or NF-kB2), which binds to kB sites. p65 (NF-kB2) is inducible by 
AP-1 . It is therefore not surprising that NF-kB2 has been shown to be induced by 

20 angiotensin II, an activator of PKC. 

Activation of NF-kB also occurs post-translationally by phosphorylation of 
its repressor, IkB, on serine residues by a serine/threonine protein kinase such as 
PKC, PKA, or Ca 2 7calmodulin-dependent protein kinase. A nuclear localization 
signal is then exposed on NF-kB, allowing it to translocate to the nucleus, where it 

25 binds to kB sites in DNA. Altering the molar ratio between NF-kB and its 

cytoplasmic inhibitor, IkB, could thus lead to a state of "constitutive" activation (too 
little IkB) or repression (excess IkB) of NF-kB. Interestingly, IkB contains kB 
binding sites within its promoter sequence, establishing an efficient negative 
feedback loop. Loss of kB binding sites within the IkB promoter would thus lead to 

30 runaway NF-kB activity. 
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A summary of the human genomic sequences which may be examined is 
presented below. This list is not meant to be exclusive; determination of genetic 
variants of any other sequences which may have similar influences on ESRD, 
hypertension, MODY, NIDDM, ORG, CRF, or apoptosis is contemplated as within 
5 the scope of this invention. 



Table 1. Summary of Genes Affecting ESRD 



20 



Vasoconstrictors 


Promoter (nri 


No, of Exons 


Accession Nos. 


Endothelin-1 


3576 


c 
3 


J05008 


Endothelin receptor-A 


476 


Qt A 1 1 A 

o(4i I4nt) w 


D11144 


Probable triggers for renal apoptosis 






TGFbl 


1140 


•7 


X05839, Y001 12 


TGFB-RI 


1397 


(not known) 


U51139 


TGFB-RII 


1883 


7(1765nt)* 


U37070 


CyclinDl(bcl-l) 


3015 


(not known) 


Z290/S 


DR1 


323 


(not known) 


T29750 


c-myc 


2462 


3 


D 10493, D90467 


P 53 


532 


11 


J04238, 








M22881-98 


RB 


900 


27 


XI 643 9, 










WT-1 


1430 


(not known) 


X74840 


vHL 


715 


(not known) 


U19763 


CACNLB3 


(not known) 


13 (1793nt)* 


D43701-4 


bcl-2 


1394 


(not known) 


X51898 


Implicated in renal apoptosis 








TNFa 


1178 


4 


X021910, 


LI 1698 






Fas 


2344 


9 


X87625, 








X82279-86 


IkBa 


506 


(not known) 


S67380 


NF-kB2 (p65, RelA) 


1396 


10 


L01459, 








222949-54 


Involved in lymphocyte apoptosis; possibly involved in renal apoptosis 




Bak 


1949 


3 


U23765, 








D88396-7 


Bax 


972 


(not known) 


U17193 


Bak-2 


3279 


1 


U16812 


iNOS 


1798 


11 


U 15666, 








U65689-99 


ecNOS 


600 


(not known) 


AF032908 



(see also 
Marsden et al. 
(1993) J. Biol. 
Chem. 



268:17478-1748 

Novel Polymorphisms 

The present application provides ten single nucleotide polymorphisms 
(SNPs) in genes associated with end stage renal diesase. Two substitution SNPs are 
located in the TGF-bl promoter. An addition six SNPs, three substitutions, two 
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deletions, and one addition, have been found in the TGF-B-RI1 promoter. The 
location of these SNPs as well as the wild type and variant nucleotides are 
summarized in Table 2. 



Preparation of Samples 

5 The presence of genetic variants in the above genes or their control regions, 

or in any other genes that may affect susceptibility to ESRD is determined by 
screening nucleic acid sequences from a population of individuals for such variants. 
The population is preferably comprised of some individuals with ESRD, so that any 
genetic variants that are found can be correlated with ESRD. The population is also 

10 preferably comprised of some individuals that have known risk for ESRD, such as 
individuals with hypertension, NIDDM, or CRF. The population should preferably 
be large enough to have a reasonable chance of finding individuals with the sought- 
after genetic variant. As the size of the population increases, the ability to find 
significant correlations between a particular genetic variant and susceptibility to 

15 ESRD also increases. Preferably, the population should have 10 or more 
individuals. 

The nucleic acid sequence can be DNA or RNA. For the assay of genomic 
DNA, virtually any biological sample containing genomic DNA (e.g. not pure red 
blood cells) can be used. For example, and without limitation, genomic DNA can be 

20 conveniently obtained from whole blood, semen, saliva, tears, urine, fecal material, 
sweat, buccal cells, skin or hair. For assays using cDNA or mRNA, the target 
nucleic acid must be obtained from cells or tissues that express the target sequence. 
One preferred source and quantity of DNA is 10 to 30 ml of anticoagulated whole 
blood, since enough DNA can be extracted from leukocytes in such a sample to 

25 perform many repetitions of the analysis contemplated herein. 

Many of the methods described herein require the amplification of DNA 
from target samples. This can be accomplished by any method known in the art but 
preferably is by the polymerase chain reaction (PCR). Optimization of conditions 
for conducting PCR must be determined for each reaction and can be accomplished 

30 without undue experimentation by one of ordinary skill in the art. In general, 

methods for conducting PCR can be found in U.S. Patent Nos 4 ? 965 r 188, 4,800,159, 
4,683,202, and 4,683,195; Ausbel et al., eds., Short Protocols in Molecular Biology, 
3 rd ed., Wiley, 1995; and Innis et al., eds., PCR Protocols, Academic Press, 1990. 
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Other amplification methods include the ligase chain reaction (LCR) (see, 
Wu and Wallace, Genomics, 4:560-569, 1989; Landegren et al.. Science, 241 : 1077- 
1080, 1988), transcription amplification (Kwoh et ah, Proc. Natl. Acad. Sci. USA, 
86:1 173-1 177, 1989), self-sustained sequence replication (Guatelli et Bl. 9 -Proc. Natl. 
5 Acad Sci. USA, 87:1874-1878, 1990), and nucleic acid based sequence 

amplification (NASBA). The latter two amplification methods involve isothermal 
reactions based on isothermal transcription, which produces both single stranded 
RNA (ssRNA) and double stranded DNA (dsDNA) as the amplification products in 
a ratio of about 30 or 100 to 1, respectively. 

10 Detection of Polymorphisms 

Detection of Unknown Polymorphisms 

Two types of detection are contemplated within the present invention. The 
first type involves detection of unknown SNPs by comparing nucleotide target 
sequences from individuals in order to detect sites of polymorphism. If the most 
1 5 common sequence of the target nucleotide sequence is not known, it can be 

determined by analyzing individual humans, animals or plants with the greatest 
diversity possible. Additionally the frequency of sequences found in subpopulations 
characterized by such factors as geography or gender can be determined. 

The presence of genetic variants and in particular SNPs is determined by 

20 screening the DNA and/or RNA of a population of individuals for such variants. If 
it is desired to detect variants associated with a particular disease or pathology, the 
population is preferably comprised of some individuals with the disease or 
pathology, so that any genetic variants that are found can be correlated with the 
disease of interest. It is also preferable that the population be composed of 

25 individuals with known risk factors for the disease. The populations should 

preferably be large enough to have a reasonable chance to find correlations between 
a particular genetic variant and susceptibility to the disease of interest. In one 
embodiment, the population should have at least 10 individuals, in another 
embodiment, the population should have 40 individuals or more. In one 

30 embodiment, the population is preferably comprised of individuals who have known 
risk factors for ESRD such as individuals with hypertension, NIDDM. or CRF. In 
addition, the allele frequency of the genetic variant in a population or subpopulation 
with the disease or pathology should vary from its allele frequency in the population 
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without the disease or pathology (control population) or the control sequence (wild 
type) by at least 1%, preferably by at least 2%, more preferably by at least 4% and 
more preferably still by at least 8%. 

Determination of unknown genetic variants, and in particular SNPs, within a 
5 particular nucleotide sequence among a population may be determined by any 
method known in the art, for example and without limitation, direct sequencing, 
restriction length fragment polymorphism (RFLP), single-strand conformational 
analysis (SSCA), denaturing gradient gel electrophoresis (DGGE), heteroduplex 
analysis (HET), chemical cleavage analysis (CCM) and ribonuclease cleavage. 

10 Methods for direct sequencing of nucleotide sequences are well known to 

those skilled in the art and can be found for example in Ausubel et al., eds., Short 
Protocols in Molecular Biology, 3 rd ed., Wiley, 1995 and Sambrook et al, Molecular 
Cloning, 2 nd ed., Chap. 13, Cold Spring Harbor Laboratory Press, 1989. Sequencing 
can be carried out by any suitable method, for example, dideoxy sequencing (Sanger 

15 et al., Proc. Natl. Acad. Sci. USA, 74:5463-5467, 1977), chemical sequencing 

(Maxam and Gilbert, Proc. Natl Acad. Sci. USA, 74:560-564, 1977) or variations 
thereof. Direct sequencing has the advantage of determining variation in any base 
pair of a particular sequence. 

RFLP analysis (see, e.g. U.S. Patents No. 5,324,631 and 5,645,995) is useful 

20 for detecting the presence of genetic variants at a locus in a population when the 

variants differ in the size of a probed restriction fragment within the locus, such that 
the difference between the variants can be visualized by electrophoresis. Such 
differences will occur when a variant creates or eliminates a restriction site within 
the probed fragment. RFLP analysis is also useful for detecting a large insertion or 

25 deletion within the probed fragment. Thus, RFLP analysis is useful for detecting, 
e.g., an Alu sequence insertion or deletion in a probed DNA segment. 

Single-strand conformational polymorphisms (SSCPs) can be detected in 
<220 bp PCR amplicons with high sensitivity (Orita et al, Proc. Natl. Acad. Sci. 
USA, 86:2766-2770, 1989; Warren et al., In: Current Protocols in Human Genetics, 

30 Dracopoli et al., eds, Wiley, 1994, 7.4.1-7.4.6.). Double strands are first heat- 
denatured. The single strands are then subjected to polyacrylamide gel 
electrophoresis under non-denaturing conditions at constant temperature (i.e. low 
voltage and long run times) at two different temperatures, typically 4-10°C and 23°C 
(room temperature). At low temperatures (4-10°C), the secondary structure of short 
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single strands (degree of intrachain hairpin formation) is sensitive to even single 
nucleotide changes, and can be detected as a large change in electrophoretic 
mobility. The method is empirical, but highly reproducible, suggesting the existence 
of a very limited number of folding pathways for short DNA strands at the critical 
5 temperature. Polymorphisms appear as new banding patterns when the gel is stained. 
Denaturing gradient gel electrophoresis (DGGE) can detect single base 
mutations based on differences in migration between homo- and heteroduplexes 
(Myers et al., Nature, 313:495-498, 1985). The DNA sample to be tested is 
hybridized to a labeled wild type probe. The duplexes formed are then subjected to 
1 0 electrophoresis through a polyacrylamide gel that contains a gradient of DNA 

denaturant parallel to the direction of electrophoresis. Heteroduplexes formed due to 
single base variations are detected on the basis of differences in migration between 
the heteroduplexes and the homoduplexes formed. 

In heteroduplex analysis (HET) (Keen et ah, Trends Genet.7:5, 1991), 
1 5 genomic DNA is amplified by the polymerase chain reaction followed by an 

additional denaturing step which increases the chance of heteroduplex formation in 
heterozygous individuals. The PCR products are then separated on Hydrolink gels 
where the presence of the heteroduplex is observed as an additional band. 

Chemical cleavage analysis (CCM) is based on the chemical reactivity of 
20 thymine (T) when mismatched with cytosine, guanine or thymine and the chemical 
reactivity of cytosine (C) when mismatched with thymine, adenine or cytosine 
(Cotton et al., Proc. Natl. Acad, Sci. USA, 85:4397-4401, 1988). Duplex DNA 
formed by hybridization of a wild type probe with the DNA to be examined, is 
treated with osmium tetroxide for T and C mismatches and hydroxylamine for C 
25 mismatches. T and C mismatched bases that have reacted with the hydroxylamine 
or osmium tetroxide are then cleaved with piperidine. The cleavage products are 
then analyzed by gel electrophoresis. 

Ribonuclease cleavage involves enzymatic cleavage of RNA at a single base 
mismatch in an RNA: DNA hybrid (Myers et al., Science 230:1242-1246, 1985). A 
30 labeled RNA probe complementary to the wild type DNA is annealed to the test 
DNA and then treated with ribonuclease A. If a mismatch occurs, ribonuclease A 
will cleave the RNA probe and the location of the mismatch can then be determined 
by size analysis of the cleavage products following gel electrophoresis. 
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Detection of Known Polymorphisms 

The second type of polymorphism detection involves determining which 
form of a known polymorphism is present in individuals for diagnostic or 
epidemiological purposes. In addition to the already discussed methods for 
5 detection of polymorphisms, several methods have been developed to detect known 
SNPs. Many of these assays have been reviewed by Landegren et aL, Genome Res,, 
8:769-776, 1998 and will only be briefly reviewed here. 

One type of assay has been termed an array hybridization assay, an example 
of which is the multiplexed allele-specific diagnostic assay (MASDA) (U.S. Patent 

10 No. 5,834,181; Shuber et al., Hum. Molec. Genet. , 6:337-347, 1997). In MASDA, 
samples from multiplex PCR are immobilized on a solid support. A single 
hybridization is conducted with a pool of labeled allele specific oligonucleotides 
(ASO). Any ASOs that hybridize to the samples are removed from the pool of 
ASOs. The support is then washed to remove unhybridized ASOs remaining in the 

1 5 pool. Labeled ASOs remaining on the support are detected and eluted from the 
support. The eluted ASOs are then sequenced to determine the mutation present. 

Two assays depend on hybridization-based allele-discrimination during PCR. 
The TaqMan assay (U.S. Patent No. 5,962,233; Livak et al., Nature Genet., 9:341- 
342, 1995) uses allele specific (ASO) probes with a donor dye on one end and an 

20 acceptor dye on the other end. such that the dye pair interact via fluorescence 

resonance energy transfer (FRET). A target sequence is amplified by PCR modified 
to include the addition of the labeled ASO probe. The PCR conditions are adjusted 
so that a single nucleotide difference will effect binding of the probe. Due to the 5' 
nuclease activity of the Taq polymerase enzyme, a perfectly complementary probe is 

25 cleaved during the PCR while a probe with a single mismatched base is not cleaved. 
Cleavage of the probe dissociates the donor dye from the quenching acceptor dye, 
greatly increasing the donor fluorescence. 

An alternative to the TaqMan assay is the molecular beacons assay (U.S. 
Patent No. 5,925,517; Tyagi et al., Nature Biotech., 16:49-53, 1998). In the 

30 molecular beacons assay, the ASO probes contain complementary sequences 

flanking the target specific species so that a hairpin structure is formed. The loop of 
the hairpin is complimentary to the target sequence while each arm of the hairpin 
contains either donor or acceptor dyes. When not hybridized to a donor sequence, 
the hairpin structure brings the donor and acceptor dye close together thereby 
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extinguishing the donor fluorescence. When hybridized to the specific target 
sequence, however, the donor and acceptor dyes are separated with an increase in 
fluorescence of up to 900 fold. Molecular beacons can be used in conjunction with 
amplification of the target sequence by PCR and provide a method for real time 
5 detection of the presence of target sequences or can be used after amplification. 

High throughput screening for SNPs that affect restriction sites can be 
achieved by Microtiter Array Diagonal Gel Electrophoresis (MADGE) (Day and 
Humphries, Anal. Biochem., 222:389-395, 1994). In this assay restriction fragment 
digested PCR products are loaded onto stackable horizontal gels with the wells 
10 arrayed in a microtiter format. During electrophoresis, the electric field is applied at 
an angle relative to the columns and rows of the wells allowing products from a 
large number of reactions to be resolved. 

Additional assays for SNPs depend on mismatch distinction by polymerases 
and ligases. The polymerization step in PCR places high stringency requirements on 
1 5 correct base pairing of the 3 ' end of the hybridizing primers. This has allowed the 
use of PCR for the rapid detection of single base changes in DNA by using 
specifically designed oligonucleotides in a method variously called PCR 
amplification of specific alleles (PASA) (Sommer et al., Mayo Clin. Proc, 64:1361- 
1372 1989; Sarker et al., Anal Biochem. 1990), allele-specific amplification (ASA), 
20 allele-specific PCR, and amplification refractory mutation system (ARMS) (Newton 
et al., Nuc. Acids Res., 1989; Nichols et al., Genomics, 1989; Wu et al., Proc. Natl. 
Acad. Set USA, 1989). In these methods, an oligonucleotide primer is designed that 
perfectly matches one allele but mismatches the other allele at or near the 3' end. 
This results in the preferential amplification of one allele over the other. By using 
25 three primers that produce two differently sized products, it can be determined 

whether an individual is homozygous or heterozygous for the mutation (Dutton and 
Sommer, BioTechniques,\\ : 700-702, 1991). In another method, termed bi-PASA, 
four primers are used; two outer primers that bind at different distances from the site 
of the SNP and two allele specific inner primers (Liu et al., Genome Res., 7:389-398, 
30 1997). Each of the inner primers have a non-complementary 5' end and form a 
mismatch near the 3' end if the proper allele is not present. Using this system, 
zygosity is determined based on the size and number of PCR products produced. 

The joining by DNA ligases of two oligonucleotides hybridized to a target 
DNA sequence is quite sensitive to mismatches close to the ligation site, especially 
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at the 3' end. This sensitivity has been utilized in the oligonucleotide ligation assay 
(Landegren et al., Science, 241:1077-1080, 1988) and the ligase chain reaction 
(LCR; Barany, Proc. Natl. Acad. Sci. USA, 88:189-193, 1991). In OLA, the 
sequence surrounding the SNP is first amplified by PCR, whereas in LCR, genomic 
5 DNA can by used as a template. 

In one method for mass screening for SNPs based on the OLA, amplified 
DNA templates are analyzed for their ability to serve as templates for ligation 
reactions between labeled oligonucleotide probes (Samotiaki et al., Genomics, 
20:238-242, 1994). In this assay, two allele-specific probes labeled with either of 
10 two lanthanide labels (europium or terbium) compete for ligation to a third biotin 
labeled phosphorylated oligonucleotide and the signals from the allele specific 
oligonucleotides are compared by time-resolved fluorescence. After ligation, the 
oligonucleotides are collected on an avidin-coated 96-pin capture manifold. The 
collected oligonucleotides are then transferred to microtiter wells in which the 
15 europium and terbium ions are released. The fluorescence from the europium ions is 
determined for each well, followed by measurement of the terbium fluorescence. 

In alternative gel-based OLA assays, numerous SNPs can be detected 
simultaneously using multiplex PCR and multiplex ligation (U.S. Patent No. 
5,830,71 1; Day et al., Genomics, 29:152-162, 1995; Grossman et al., Nuc. Acids 
20 Res., 22:4527-4534, 1994). In these assays, allele specific oligonucleotides with 

different markers, for example, fluorescent dyes, are used. The ligation products are 
then analyzed together by electrophoresis on an automatic DNA sequencer 
distinguishing markers by size and alleles by fluorescence. In the assay by 
Grossman et al., 1994, mobility is further modified by the presence of a non- 
25 nucleotide mobility modifier on one of the oligonucleotides. 

A further modification of the ligation assay has been termed the dye-labeled 
oligonucleotide ligation (DOL) assay (U.S. Patent No. 5,945,283; Chen et al., 
Genome Res., 8:549-556, 1998). DOL combines PCR and the oligonucleotide 
ligation reaction in a two-stage thermal cycling sequence with fluorescence 
30 resonance energy transfer (FRET) detection. In the assay, labeled ligation 

oligonucleotides are designed to have annealing temperatures lower than those of the 
amplification primers. After amplification, the temperature is lowered to a 
temperature where the ligation oligonucleotides can anneal and be ligated together. 
This assay requires the use of a thermostable ligase and a thermostable DNA 
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polymerase without 5* nuclease activity. Because FRET occurs only when the 
donor and acceptor dyes are in close proximity, ligation is inferred by the change in 
fluorescence. 

In another method for the detection of SNPs termed minisequencing, the 
5 target-dependent addition by a polymerase of a specific nucleotide immediately 
downstream (3') to a single primer is used to determine which allele is present (U.S 
Patent No. 5,846,710). Using this method, several SNPs can be analyzed in parallel 
by separating locus specific primers on the basis of size via electrophoresis and 
determining allele specific incorporation using labeled nucleotides. 

10 Determination of individual SNPs using solid phase minisequencing has 

been described by Syvanen et al., Am. J. Hum. Genet., 52:46-59, 1993. In this 
method the sequence including the polymorphic site is amplified by PCR using one 
amplification primer which is biotinylated on its 5' end. The biotinylated PCR 
products are captured in streptavidin-coated microtitration wells, the wells washed, 

15 and the captured PCR products denatured. A sequencing primer is then added 
whose 3' end binds immediately prior to the polymorphic site, and the primer is 
elongated by a DNA polymerase with one single labeled dNTP complementary to 
the nucleotide at the polymorphic site. After the elongation reaction, the sequencing 
primer is released and the presence of the labeled nucleotide detected. Alternatively, 
20 dye labeled dideoxynucleoside triphosphates (ddNTPs) can be used in the elongation 
reaction (U.S. Patent No. 5,888,819; Shumaker et al., Human Mut., 7:346-354, 
1996). In this method, incorporation of the ddNTP is determined using an automatic 
gel sequencer. 

Minisequencing has also been adapted for use with microarrays (Shumaker et 
25 al., Human Mut, 7:346-354, 1996). In this case, elongation (extension) primers are 
attached to a solid support such as a glass slide. Methods for construction of 
oligonucleotide arrays are well known to those of ordinary skill in the art and can be 
found, for example, in Nature Genetics, Supply Vol. 21, January, 1999. PCR 
products are spotted on the array and allowed to anneal. The extension (elongation) 
30 reaction is carried out using a polymerase, a labeled dNTP and noncompeting 
ddNTPs. Incorporation of the labeled dNTP is then detected by the appropriate 
means. In a variation of this method suitable for use with multiplex PCR, extension 
is accomplished with the use of the appropriate labeled ddNTP and unlabeled 
ddNTPs (Pastinen et al., Genome Res., 7:606-614, 1997). 
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Solid phase minisequencing has also been used to detect multiple 
polymorphic nucleotides from different templates in an undivided sample (Pastinen 
et al., Clin. Chem., 42:1391-1397, 1996). In this method, biotinylated PCR products 
are captured on the avidin-coated manifold support and rendered single stranded by 
5 alkaline treatment. The manifold is then placed serially in four reaction mixtures 
containing extension primers of varying lengths, a DNA polymerase and a labeled 
ddNTP, and the extension reaction allowed to proceed. The manifolds are inserted 
into the slots of a gel containing formamide which releases the extended primers 
from the template. The extended primers are then identified by size and 

1 0 fluorescence on a sequencing instrument. 

Fluorescence resonance energy transfer (FRET) has been used in 
combination with minisequencing to detect SNPs (U.S. Patent No. 5,945,283; Chen 
et al., Proc. Natl Acad. Set USA, 94:10756-10761, 1997). In this method, the 
extension primers are labeled with a fluorescent dye, for example fluorescein. The 

1 5 ddNTPs used iii primer extension are labeled with an appropriate FRET dye. 

Incorporation of the ddNTPs is determined by changes in fluorescence intensities. 

The above discussion of methods for the detection of SNPs is exemplary 
only an d is not intended to be exhaustive. Those of ordinary skill in the art will be 
able to envision other methods for detection of SNPs that are within the scope and 

20 spirit of the present invention. 

In one embodiment the present invention provides a method for diagnosing a 
genetic predisposition for a disease and in particular, end-stage renal disease and 
hypertension. In this method, a biological sample is obtained from a subject. The 
subject can be a human being or any vertebrate animal. The biological sample must 

25 contain polynucleotides and preferably genomic DNA. Samples that do not contain 
genomic DNA, for example, pure samples of mammalian red blood cells, are not 
suitable for use in the method. The form of the polynucleotide is not critically 
important such that the use of DNA, cDNA, RNA or mRNA is contemplated within 
the scope of the method. The polynucleotide is then analyzed to detect the presence 

30 of a genetic variant where such variant is associated with an increased risk of 

developing a disease, condition or disorder, and in particular end-stage renal disease. 
In one embodiment, the genetic variant is located at one of the polymorphic sites 
contained in Table 2. In another embodiment, the genetic variant is one of the 
variants contained in Table 2 or the complement of any of the variants contained in 
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Table 2. Any method capable of detecting a genetic variant, including any of the 
methods previously discussed, can be used. Suitable methods include, but are not 
limited to, those methods based on sequencing, mini sequencing, hybridization, 
restriction fragment analysis, oligonucleotide ligation, or allele specific PCR. 
5 The present invention is also directed to an isolated nucleic acid sequence of 

at least 10 contiguous nucleotides from SEQ ID NO: 1 or 2, or the complements of 
SEQ ID NO 1 or 2. In one preferred embodiment, the sequence contains at least one 
polymorphic site associated with a disease, and in particular end-stage renal disease. 
In one embodiment, the polymorphic site is selected from the group contained in 
1 0 Table 2. In another embodiment, the polymorphic site contains a genetic variant, 
and in particular, the genetic variants contained in Table 2 or the complements of the 
variants in Table 2. In yet another embodiment, the polymorphic site, which may or 
may not also include a genetic variant, is located at the 3 5 end of the polynucleotide. 
In still another embodiment, the polynucleotide further contains a detectable marker. 
1 5 Suitable markers include, but are not limited to, radioactive labels, such as 
radionuclides, fluorophores or fluorochromes, peptides, enzymes, antigens, 
antibodies, vitamins or steroids. 

The present invention also includes kits for the detection of polymorphisms 
associated with diseases, conditions or disorders, and in particular end-stage renal 
20 disease and hypertension. The kits contain, at a minimum, at least one 

polynucleotide of at least 10 contiguous nucleotides of SEQ ID NO 1 or 2, or the 
complements of SEQ ID NO: 1 or 2. In one embodiment, the polynucleotide 
contains at least one polymorphic site, preferably a polymorphic site selected from 
the group contained in Table 2. Alternatively the 3' end of the polynucleotide is 
25 immediately 5' to a polymorphic site, preferably a polymorphic site contained in 
Table 2. In one embodiment, the polymorphic site contains a genetic variant, 
preferably a genetic variant selected from the group contained in Table 2. In still 
another embodiment, the genetic variant is located at the 3' end of the 
polynucleotide. In yet another embodiment, the polynucleotide of the kit contains a 
30 detectable label. Suitable labels include, but are not limited to, radioactive labels, 
such as radionuclides, fluorophores or fluorochromes, peptides, enzymes, antigens, 
antibodies, vitamins or steroids. 

In addition, the kit may also contain additional materials for detection of the 
polymorphisms. For example, and without limitation, the kits may contain buffer 
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solutions, enzymes, nucleotide triphosphates, and other reagents and materials 
necessary for the detection of genetic polymorphisms. Additionally, the kits may 
contain instructions for conducting analyses of samples for the presence of 
polymorphisms and for interpreting the results obtained. 
5 In yet another embodiment the present invention provides a method for 

designing a treatment regime for a patient having a disease, condition or disorder 
and in particular end stage renal disease and hypertension caused either directly or 
indirectly by the presence of one or more single nucleotide polymorphisms. In this 
method genetic material from a patient, for example, DNA, cDNA, RNA or mRNA 

10 is screened for the presence of one or more SNPs associated with the disease of 
interest. Depending on the type and location of the SNP, a treatment regime is 
designed to counteract the effect of the SNP. 

Alternatively, information gained from analyzing genetic material for the 
presence of polymorphisms can be used to design treatment regimes involving gene 

1 5 therapy. For example, detection of a polymorphism that either affects the expression 
of a gene or results in the production of a mutant protein can be used to design an 
artificial gene to aid in the production of normal, wild type protein or help restore 
normal gene expression. Methods for the construction of polynucleotide sequences 
encoding proteins and their associated regulatory elements are well know to those of 

20 ordinary skill in the art. Once designed, the gene can be placed in the individual by 
any suitable means known in the art (Gene Therapy Technologies, Applications and 
Regulations, Meager, ed., Wiley, 1999; Gene Therapy: Principles and Applications, 
Blankenstein, ed., Birkhauser Verlag, 1999; Jain, Textbook of Gene Therapy, 
Hogrefe and Huber, 1998). 

25 The present invention is also useful in designing prophylactic treatment 

regimes for patients determined to have an increased susceptibility to a disease, 
condition or disorder, and in particular end stage renal disease and hypertension due 
to the presence of one or more single nucleotide polymorphisms. In this 
embodiment, genetic material, such as DNA, cDNA, RNA or mRNA, is obtained 

30 from a patient and screened for the presence of one or more SNPs associated either 
directly or indirectly to a disease, condition, disorder or other pathological condition. 
Based on this information, a treatment regime can be designed to decrease the risk of 
the patient developing the disease. Such treatment can include, but is not limited to, 
surgery, the administration of pharmaceutical compounds or nutritional 
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supplements, and behavioral changes such as improved diet, increased exercise, 
reduced alcohol intake, smoking cessation, etc. 

EXAMPLES 

Positions of the single nucleotide polymorhisms (SNPs) are given according 
5 to the numbering scheme in the GenBank Accession No. J0443 1 for the TGF-bl 
promoter region and Accession No. U37070 for the TGF-bl receptor II promoter 
region. GenBank sequences can be found at http;//www.ncbi.nim.mh.gov/ 

In the following examples, SNPs are written as "wild type nucleotide" -> 
"variant nucleotide." Changes in nucleotide sequences are indicated in bold print 
1 0 and with double underlining. The standard nucleotide abbreviations are used in 

which A=adenine, C=cytosine, G=guanine, T=thymine, M=A or C, R=A or G, W=A 
or T, S=C or G, Y=C or T, K=G or T, V=A or C or G, H=A or C or T; D-A or G or 
T; B=C or G or T; N= A or C or G or T. 

Example 1 

15 Detection of Novel Polymorphisms bv PCR-SSCA 

Leukocytes were obtained from human whole blood collected with EDTA. 
Blood was obtained from a group of 40 patients with ESRD. The group was made 
up of Caucasians and African Americans of both sexes in unequal ratios. 

Genomic DNA was purified from the collected leukocytes using standard 
20 protocols well known to those of ordinary skill in the art of molecular biology 

(Ausubel et al., Short Protocol in Molecular Biology, 3 rd ed., John Wiley and Sons, 
1995; Sambrook et al., Molecular Cloning, Cold Spring Harbor Laboratory Press, 
1989; and Davis et al., Basic Methods in Molecular Biology, Elsevier Science 
Publishing, 1986). One hundred nanograms of purified genomic DNA was used in 
25 each PCR reaction. 

Standard PCR reaction conditions were used. Methods for conducting PCR 
are well known in the art and can be found, for example, in U.S. Patent Nos 
4,965,188, 4,800,159, 4,683,202, and 4,683,195; Ausbel et al., eds., Short Protocols 
in Molecular Biology, 3 rd ed., Wiley, 1995; and Innis et al., eds., PCR Protocols, 
30 Academic Press, 1990. Specific primers used are given in the following examples. 

PCR reactions were carried out in a total volume of 25 ml containing 100 ng 
leukocyte genomic DNA, 10 pmol of each primer, 200 nM deoxynucleotide 



WO 00/491 69 PCT/US00/04251 

34 

triphosphates (dNTPs), 1 U Taq polymerase (Perkin-Elmer), IX PCR buffer (50 mM 
KCL, 10 mM Tris-HCl, pH 8.3, 1.5 mM MgCl 2 , and 0.01% (w/v) gelatin), and 3% 
(v/v) DMSO. After an initial 3 minutes denaturation at 94°C, 35 cycles were carried 
out consisting of 1 minute denaturation at 94°C, 1 minute hybridization at 55 °C, 2 
5 minute extension at 72 °C, followed by a final extension step of 5 minute at 72 °C, 
and 1 minute cooling at 35 °C. Following amplification, 25ml of PCR product were 
mixed with 12.5 ml denatuaring solution (95% (v/v) formamide, 20 mM xylene 
cyanole FF) and were incubated for 5 minutes at 95 °C. Subsequently, samples were 
chilled in an ice-water bath for 5 minutes and loaded onto a 10% (w/v) 

10 polyacrylamide gel (acrylamide:bisacrylamide = 49:1 ; 72 mm X 102 mm X 0.75 
mm, Bio-Rad Mini PROTEAN II), containing 0.5X Tris-buffered EDTA solution. 
Electrophoresis for 1 7-17.5 hours was carried out under two conditions: 30 V at 4 
°C, and 30 V at room temperature (approx. 23 °C). After electrophoresis, gels were 
stained with 0.5 mg/ml ethidium bromide for 8 minutes. 

1 5 Any polymorphic bands detected were cut out of the gel, amplified with the 

same primers as used for that gel's PCR reaction, blunt-end ligated into a vector 
such as pBlueScript II 6 (Stratagene), cloned, and sequenced. Methods for ligation, 
cloning and sequencing are well known in the art and can be found, for example in 
Ausubel et al., Short Protocol in Molecular Biology, 3 rd ed., John Wiley and Sons, 

20 1995; Sambrook et al., Molecular Cloning, Cold Spring Harbor Laboratory Press, 
1989; and Davis et al., Basic Methods in Molecular Biology, Elsevier Science 
Publishing, 1986. 

Example 2 

G to A Substitution at Position 563 of Human TGF-bl Promoter 
25 PCR and sequencing were conducted as described in Example 1 . The sense 

primer was 5 '-TGC ATGGGGACACC ATCT AC AG-3 ' (SEQ ID NO: 3) and the 
antisense primer was 5'-TCTTGACCACTGTGCCATCCTC-3* (SEQ ID NO: 4). 
The 202 nt PCR product spanned positions 42 1 to 622 of the human TGF bl gene 
(SEQ ID NO: 1). A G563~>A substitution mutation (transition) was found at 
30 position 563 with an allele frequency of 33.3% in Caucasian men with ESRD due to 
HTN but not in a control group of 40 Caucasians and African Americans or in 
patients with ESRD due to NIDDM. 
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The polymorphism results in disruption of a potential TCF1 1 homodimer site 
(S'-CTCATNNWNNNNO*) (SEQ ID NO: 5) beginning at position 563 on the (+) 
strand. The indicated G is replaced by an A in this SNP. TCF1 1 sites occur 
relatively frequently: 4.63 matches per 1000 base pairs of random genomic DNA 
5 sequence in vertebrates. 

Other names for TCF1 1 are KCR-F1 and Nrfl. TCF1 1 is a transcriptional 
activator of the basic region-leucine zipper class, abbreviated as "bZIP." It is 
located on chromosome 1 7q22, and is homologous to transcription factors involved 
in Drosophila ("cap 'n collar," or CNC) and C. elegans (skin-1) embryonic 
10 development, as well as NF-E2, which regulates the globin locus in erythroid cells. 
TCF1 1 has not yet been implicated in control of TGF-bl. Disruption of this 
transcriptional regulatory site can result in lower transcription of TGF-bl, which 
runs counter to the currently prevailing notion that TGF-bl overexpression, rather 
than underexpression, is associated with chronic renal failure. 
1 5 The polymorphism also causes disruption of a potential ATF (activating 

transcription factor) site, which consists of the complement of 5'- 
GRNNNACGTC ASNG-3 ' (SEQ ID NO: 6), ending at nucleotide 556 on the (-) 
strand. ATF sites occur relatively rarely: 0.34 times per 1000 base pairs of random 
genomic sequence in vertebrates. Disruption of this site is also expected to result in 
20 decreased transcription of TGF-bl, leading to an expected decrease in the levels of 
TGF-b 1 mRNA and protein in tissues. 

Also disrupted is a potential CREB (cAMP -responsive element binding 
protein) site. Five variations of this site, all centered at this SNP and requiring G563 
for maximal activity, exist. In all of them, the indicated Q is replaced by an A with 
25 this SNP, as follows: 

(1) 5'-NSTKACGTCASN-3' (SEQ ID NO: 7), beginning 

at nucleotide 557; this sequence occurs only 0.09 times per 1000 base pairs of 
random genomic sequence in vertebrates, so its disruption by this SNP is highly 
significant. 

30 (2) 5'-NNTTACK£TCASN-3' (SEQ ID NO: 8), also 

beginning at nucleotide 557; this sequence occurs 0.34 times per 1000 base pairs of 
random genomic sequence in vertebrates, which is also relatively rare. 
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(3) 5'-NNTKACCTCASN-3 (SEQ ID NO: 9), also 
beginning at nucleotide 557; this sequence occurs 0.34 times per 1000 base pairs of 
random genomic sequence in vertebrates, which is also relatively rare. 

(4) 5 '-NNRCCTCANCNN-3 ' (SEQ ID NO: 10), 

5 beginning at nucleotide 559; this sequence occurs less rarely, at 1.12 times per 1000 
base pairs of random genomic sequence in vertebrates. 

(5) 5'-TKACGTCA-3\ also beginning at nucleotide 559; 
this sequence occurs 0.40 times per 1000 base pairs of random genomic sequence in 
vertebrates, which is relatively rare. 

10 There is also disruption of a potential CREBCJUN (cAMP-responsive 

element binding protein/c-Jun heterodimer) binding site, 5'-TRAC<7TCA-3\ 
beginning at nucleotide 559. This sequence occurs relatively rarely in vertebrates, 
0.22 times per 1000 base pairs of random genomic sequence. 

Additionally, the G563~>A SNP disrupts an API (activator protein 1) site, 

15 5'-WNKNACTrCASY-3' (SEQ ID NO: 11), beginning at nucleotide 558 where the 
indicated Q is replaced with an A. This site occurs relatively frequently: 1.82 times 
per 1000 bases of random genomic sequence in vertebrates. 

The G563-- >A SNP appears to be strongly associated with end-stage renal 
disease due to hypertension in Caucasian men as a major allele. This SNP is 

20 predicted to disrupt a number of rare sites for potent transcriptional activators such 
as TCF1 1, CRE binding protein, CREB/c-Jun heterodimer, and API (c-Fos/c-Jun 
heterodimer), as well as more common sites, e.g. for TCF1 1 . The expected result is a 
decrease in TGF-bl transcription, with a resultant decrease in tissue abundance of 
TGF-bl protein. 

25 Example 3 

C to G Substitution at Position 216 of Human TGF-bl Promoter 

PCR and sequencing were conducted as described in Example 1 . The sense 
primer was 5 ' - AAGACGGTGGG AGCCT AG AAAG-3 ' (SEQ ID NO: 12) and the 
antisense primer was S'-TGGGACC AC ACCTGGAAATG-3' (SEQ ID NO: 13). 
30 The PCR product produced spanned positions 66 to 265 of the human TGF-bl gene 
(SEQ ID NO: 1). A C216-- >G substitution mutation (transversion) was found at 
position 216 with an allele frequency of 8.3% in African American women with 
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ESRD due to NIDDM but not in a control group of 40 Caucasians and African 
Americans or in patients with ESRD due to HTN.. 

The polymorph™ results in disruption of a putative FSE2 site (nucleotides 
216 to 224) in the TGF-bl promoter, approximately 2kb upstream (5') of the 
5 transcription initiation site. The TGF-bl promoter has two FSE2 sites; the second 
one is located approximately 600 bases downstream from the first site (at 
nucleotides 807-816). FSE2 sites are potent negative transcriptional regulatory sites; 
disruption of a site is thus expected to result in increased transcription of the TGF-bl 
gene. Assuming that mRNA stability, translational efficiency, etc. are unchanged, 
10 this SNP is expected to result in increased cellular production and secretion of TGF- 
bl . Since several models of chronic renal failure are associated with increased 
expression of TGF-bl, this SNP can account for the increased fibrosis of the renal 
parenchyma observed in chronic renal failure. 

The polymorphism also results in disruption of a potential GKLF (gut- 
15 enriched Krueppel-like factor) site beginning at nucleotide 21 1 according to 

numbering on the (+) strand. The binding site is actually located on the (-) strand, 
and consists of the complement to the sequence 5 '-CCYYT FYYTYNTTY-3 * (SEQ 
ID NO: 14). This SNP replaces the underlined Y (C or T) with a G. GKLF sites 
occur relatively frequently, 4.76 matches per 1000 base pairs of random genomic 
20 sequence in vertebrates. 

GKLF is a transcriptional activator, so disruption of its binding site in the 
TGF-bl promoter should result in a lower rate of TGF-bl transcription, and 
ultimately a lower level of TGF-bl produced in tissues. However, since GKLF has 
not been implicated so far in the regulation of TGF-bl in the kidney, GKLF may 
25 represent a heretofore unrecognized participant in TGF-bl regulation, and hence 
renal failure. 

Example 4 

C to T Substitution at Position 552 of TGF-b-RII Promoter 

PCR and sequencing were conducted as in Example 1 . The sense primer was 
30 5 ' - AAGTC ACTCC AGCTTTGGC AAG-3 5 (SEQ ID NO: 15) and the antisense 
primer was 5'-TTGCAGAGGCAGGGTTTGTG-3' (SEQ It) NO: 16). The PCR 
product produced spanned bases 494-767 of the TGF-b-RII promoter. A C552->T 
substitution mutation (transition) was found at position 552 with an allele frequency 
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of 8.3% in African American women with ESRD due to NIDDM, but not in the 
reference sequence (SEQ ID NO:2) or patients with ESRD due to HTN. 

The polymorphim results in disruption of a potential CETS1P54 [c-Ets- 
l(p54)] site, which consists of the complement of S'-NRCWTCCKGNO' (SEQ ID 
5 NO: 17) beginning at position 546. The site is located on the (-) strand. CETS1P54 
sites occur relatively frequently: 1.53 matches per 1000 base pairs of random 
genomic sequence in vertebrates. Since c-Ets-l(p54) is a transcriptional activator, 
disruption of its binding site is expected to result in a decreased rate of transcription 
of the TGFb-RII gene. If the rate of translation is tied to the level of messenger 

10 RNA, as is the case for most proteins, then less gene product (TGFb-RII) will be the 
result. TGF-bl binds to TGFb-RII at the cell surface. TGFb-RII is required for 
specific recognition of the TGF-bl ligand. Once the ligand is bound, TGFb-RII then 
recruits TGFb-RI which is already present (and constitutiveiy active) in the 
membrane. The heterodimer transduces the TGF-bl signal to the cell's nucleus 

15 through additional components such as the Smad proteins, 

Decreased TGFb-RII production is expected to decrease signalling by TGF- 
bl. How decreased (rather than increased) TGF-bl signalling leads to ESRD is 
unclear, and contradicts our current understanding of the pathogenesis of chronic 
renal failure. 



20 Example 5 

Deletion at Position 607 of TGF -b-RII Promoter 
PCR and sequencing were conducted as in Example 1 . The PCR primers 
used were the same as in Example 4. A deletion polymorphism in which one of the 
As at positions 607 and 608 is eliminated was found with an allele frequency of 
25 1 6.7% in African American women with ESRD due to NIDDM, but not in the 
reference sequence (SEQ ED NO: 2) or in patients with ESRD due to HTN. 

The polymorphism results in disruption of a potential SOX5 binding site, 
which consists of the sequence 5'-NNAAG4ATNN-3' (SEQ ID NO: 18), beginning 
at nucleotide 602 on the (+) strand. The A607del polymorphism results in deletion 
30 of the indicated A. SOX5 is a member of a large family of transcription factors 
which bind the so-called "HMG" box, AACAAT. SOX stands for "SRY-related 
HMG box." SOX5 binding sites occur fairly frequently in vertebrate genomic DNA 
at 1.10 sites per 1 000 bases of random genomic sequence in vertebrates. The SOX 
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proteins have been implicated in a variety of developmental processes, such as 
chondrogenesis, but not in the progression of chronic renal failure. Since SOX5 is a 
transcriptional activator, disruption of its binding site is expected to result in 
decreased TGFb-RII gene expression. 
5 The polymorphism also causes disruption of a potential TH1E47 

(Thing 1/E47 heterodimer) binding site, consisting of the sequence complementary to 
5 '-A^WKCCAGAYNCNNNN-3 ' (SEQ ID NO: 19) on the (-) strand. The 
indicated A is deleted by this SNP. TH1E47 binding sites occur relatively 
frequently, 2.04 sites per 1000 bases of random genomic DNA in vertebrates. 
1 0 Disruption of this site is similarly expected to result in decreased TGFb-RII gene 
expression. 

Example 6 

Deletion at Position 619 of TGF-b-RII Promoter 
PCR and sequencing were conducted as in Example 1 . The PCR primers 
1 5 used were the same as in Example 4. A deletion polymorphism in which one of the 
Gs at positions 619 and 620 is eliminated was found with an allele frequency of 
16.7% in African American women with ESRD due to NTDDM, but not in the 
reference sequence (SEQ ID NO: 2) or in patients with ESRD sue to HTN. 

The G619del SNP disrupts the TH1E47 (Thingl/E47 heterodimer) binding 
20 site, which consists of the sequence complementary to 5'- 

AAWKCCAGAYNCNNNN-3 * (SEQ ID NO: 20). This sequence begins at 
nucleotide 606 on the (+) strand. The TH1E47 binding site is located on the (-) 
strand. The indicated N is deleted by this SNP, but it is expected to make no 
difference in binding by TH1E47. TH1E47 binding sites occur relatively frequently, 
25 2.04 sites per 1 000 bases of random genomic DNA in vertebrates. 

Example 7 

Insertion at Position 757 of TGF-b-RII Promoter 
PCR and sequencing were conducted as in Example 1 . The sense primer was 
5 * -GG AGTTGGGTTTGGGGG AG-3 ' (SEQ ID NO: 21) and the antisense primer 
30 was 5 '-TCTTGCTAGGGCAACCAGATTTG-3 ' (SEQ ID NO: 22). The PCR 
product produced spanned bases 697 to 988 of the TGF-b-RII promoter (SEQ ID 
NO: 2). An insertion polymorphism in which an additional T is inserted at position 
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757 was discovered with an allele frequency of 8.3% in African American women 
with ESRD due to NIDDM and 33.3% in African American women with ESRD due 
to HTN, but not in the reference sequence (SEQ ID NO: 2). 

Insertion of a T after nucleotide 757 occurs in a potential NF-1 (nuclear 
5 factor- 1) site on the (+) strand, consisting of the sequence 

5 ' -NNTTGGCNNNNNNCC^NN-3 ' (SEQ ID NO: 23), beginning at position 742. 
T is inserted after the indicated nucleotide N . NF1 binding sites occur rather 
frequently, 4.1 1 times per 1000 base pairs of random genomic sequence in 
vertebrates. However, this insertion is not expected to disrupt binding by NF-1, as 
1 0 there is no stringency for the final three nucleotides. 

Example 8 

A to G Substitution at Position 792 of TGF-b-RII Promoter 
PCR and sequencing were conducted as in Example 1 . The primers were the 
same as in Example 7. An A to G substitution polymorphism (transition) was 

1 5 discovered at position 792 with an allele frequency of 4.2% in African American 
women with ESRD due to NIDDM and 1 6.7% in African American women with 
ESRD due to HTN, but not in the reference sequence (SEQ ID NO: 2). 

The polymorphism results in disruption of an S 8 binding site, which consists 
of the sequence 5 '-NNNNNYA^TTAN-3 ' (SEQ ID NO: 24), beginning at position 

20 784 on the (+) strand. This polymorphism replaces the indicated A with a G, which 
should result in a weaker binding site for S8, a transcriptional activator. S8 binding 
sites occur rather frequently, 1.73 times per 1000 base pairs of random genomic 
sequence in vertebrates. 

S8 is a homeobox protein, also called Prx2, which operates at the junction of 

25 epithelial and mesenchymal tissues. It is a transcription factor active during 

organogenesis. During development of the kidney, ureteral ectoderm (i.e. epithelial 
cells) induce surrounding mesoderm (or mesenchyme) to differentiate into proximal 
and distal tubules. It has been suggested that developmental pathways operating 
during embryonic life might be reactivated during disease processes. 

30 Since S8 is a positive transcriptional regulator, disruption of its binding site 

is expected to result in a decreased rate of transcription of the TGFb-RII gene. If the 
rate of translation is tied to the level of messenger RNA, as is the case for most 
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proteins, then less gene product (TGFb-RII protein) will be the result, ultimately 
leading to less TGFbl signalling in tissues. 

The polymorphism also results in disruption of a TCF1 1 (TCF1 1/KCR- 
Fl/Nrfl homodimer) binding site, which consists of the sequence 5 5 - 
5 GTC4TNNWNNNNN-3 * (SEQ ID NO: 25), beginning at position 789 on the (+) 
strand. This polymorphism replaces the indicated A with a G, which should result in 
a weaker binding site for TCF1 1 , a transcriptional activator. TCF1 1 binding sites 
occur moderately frequently, at the rate of 4.63 sites per 1000 base pairs of random 
genomic sequence in vertebrates. Since TCF1 1 is a transcriptional activator, 

10 disruption of its binding site is expected to result in a decreased rate of transcription 
of the TGFb-RII gene. If the rate of translation is tied to the level of messenger 
RNA, as is the case for most proteins, then less gene product (TGFb-RII protein) 
will be the result, ultimately leading to less TGFbl signalling in tissues. 

The A792->G SNP appears to be a relatively important allele for ESRD due 

1 5 to hypertension in African American women. 

Example 9 

G to A Substitution at Position 1009 of TGF-b-RII Promoter 
PGR and sequencing were conducted as in Example 1. The sense primer was 
5 '-GGACATATCTGAAAGAGAAAGGGGG-3 ' (SEQ ID NO: 26) and the 

20 antisense primer was 5'-TTGGGAGTCACCTGAATGCTTG-3' (SEQ ID NO: 27). 
The PCR product produced spanned bases 892 to 1113 of the TGF-b-RII promoter. 
A Gl 009->A substitution polymorphism (transition) was discovered at position 
1009. This polymorphism had an allele frequency of 37.5% in members of a control 
group of 20 Caucasians and African Americans In patients with ESRD due to 

25 NIDDM, the polymorphism had an allele frequency of 25% in Caucasian men, 

12.5% in Caucasian women, 33.3% in African American men, and 58.3% in African 
American women. In patients with ESRD due to HTN, the polymorphism had an 
allele frequency of 16.7% in Caucasian men, 25% in Caucasian women, 40% in 
African American men, and 50% in African American women. 

30 The G1009-- >A transition is quite frequent in the control population, with an 

allele frequency of 37.5%. It appears to occur at roughly the same frequency in 
African American men with ESRD due to hypertension (40%) or NIDDM (33%), 
but at a significantly lower frequency in both male and female Caucasians with 
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ESRD due to hypertension or NIDDM. It appears to occur at a significantly higher 
frequency among African American women with ESRD due to either hypertension 
(50%) or NIDDM (58%). 

These data would suggest that the G1009->A SNP is protective for 
5 Caucasians with NIDDM or hypertension, neutral for African American men with 
NIDDM or hypertension, but harmful for African American women with NIDDM or 
hypertension. The mechanism for such a complex effect is unknown, but suggests 
the involvement of additional modifier genes that differ in frequency according to 
gender and ethnicity. 

10 A comparison of the allele frequency of the G1009~>A SNP was also made 

between a group of 28 Caucasian males with HTN, but not ESRD, and a control 
group of 26 Caucasian males with neither HTN nor ESRD. The allele frequency for 
the G1009-->A SNP was 12.5% in the group with HTN as compared to 34.6% in the 
control group. These data indicate that the G1009-->A SNP is protective for the 

1 5 development of HTN. 



Table 2 



Gene 


Region 


Location 


Wild Type 


Variant 


SEQID 


TGF-bl 


Promoter 


216 


C 


G 


1 






563 


G 


A 


1 


TGF-b-RII 


Promoter 


552 


C 


T 


2 






607-608 


AA 


A 


2 






619-620 


GG 


G 


2 






757 


T 


TT 


2 






792 


A 


G 


2 






1009 


G 


A 


2 



WO 00/491 69 PCT/USOO/04251 

43 

What is claimed is: 

1 . A method for diagnosing a genetic susceptibility for end-stage renal disease in a 
subject comprising: 

obtaining a biological sample containing at least one polynucleotide from said 
subject; and 

5 analyzing said polynucleotide to detect a genetic polymorphism wherein said 
genetic polymorphism is associated with an altered susceptibility for end-stage 
renal disease. 

2. The method of claim 1 , wherein said polynucleotide is DNA, RNA, cDNA or 
mRNA. 

3. The method of claim 1, wherein said polymorphism is located in a gene selected 
from the group consisting of the genes in Table L 

4. The method of claim 1, wherein said polymorphism is located in the TGF-bl 
gene or the TGF-b-RII gene. 

5. The method of claim 1, wherein said genetic polymorphism is located at a site 
selected from the group consisting of the polymorphic sites in Table 2. 

6. The method of claim 1 , wherein said genetic polymorphism is a genetic variant 
selected from the group consisting of the variants in Table 2. 

7. The method of claim 1, wherein said genetic polymorphism is a genetic variant 
selected from the group consisting of the complements of the genetic variants in 
Table 2. 

8. The method of claim 1, wherein said analysis is accomplished by sequencing, 
mini sequencing, hybridization, restriction fragment analysis, oligonucleotide 
ligation assay or allele specific PCR. 
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9. An isolated nucleic acid sequence comprising at least 10 contiguous nucleotides 
from SEQ ID NO: 1 or 2, wherein said sequence contains at least one variant 
associated with end stage renal disease so that said sequence differs from SEQ 
ID NO: 1 or 2. 

10. The isolated nucleic acid sequence of claim 9, wherein said variant is located at a 
site selected from the group consisting of the sites in Table 2. 

1 1 . The isolated nucleic acid sequence of claim 9, wherein said variant is selected 
from the group consisting of the variants in Table 2. 

12. The isolated nucleic acid sequence of claim 9, wherein said variant is selected 
from the group consisting of the complements of the variants in Table 2. 

13. The isolated sequence of claim 9, wherein said variant is located at the 3' end of 
said nucleic acid sequence. 

14. The isolated nucleic acid sequence of claim 9, further comprising a detectable 
label. 

15. The isolated nucleic acid sequence of claim 14, wherein said detectable label is 
selected from the group consisting of radionuclides, fluorophores or 
fluorochromes, peptides, enzymes, antigens, antibodies, vitamins or steroids. 

16. A kit for the detection of a polymorphism comprising at least one polynucleotide 
of at least 1 0 contiguous nucleotides from SEQ ID NO 1 or 2, wherein said 
polynucleotide contains at least one variant associated with end stage renal 
disease; a means for detecting the presence of said polymorphism; and 

5 instructions for detecting said polymorphism. 

17. The kit of claim 16, wherein said variant is located at a site selected from the 
group consisting of the polymorphic sites in Table 2. 
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18. The kit of claim 16, wherein said variant is selected from the group consisting of 
the variants in Table 2. 

19. The kit of claim 16, wherein said variant is selected from the group consisting of 
the complements of the variants in Table 2. 

20. The kit of claim 16, wherein said variant is locatred at the 3' end of said 
polynucleotide. 

21. The kit of claim 16, wherein said polynucleotide further comprises a detectable 
label. 



22. The kit of claim 21, wherein said detectable label is chosen from the group 
consisting of radionuclides, fluorophores or fluorochromes, peptides enzymes, 
antigens, antibodies, vitamins or steroids. 

23. A kit for the detection of a polymorphism comprising at least one polynucleotide 
of at least 10 contiguous nucleotides of SEQ ID NO: 1 or 2 or the complement 
thereof; wherein the 3' end of said polynucleotide is immediately 5' to a genetic 
variant chosen from the group consisting of the variants in Table 2 and the 

5 complements thereof 

24. The kit of claim 23, wherein said polynucleotide further comprises a detectable 
label. 

25. The kit of claim 24, wherein said detectable label is chosen from the group 
consisting of radionuclides, fluorophores or fluorochromes, peptides, enzymes, 
antigens, antibodies, vitamins or steroids. 

26. A method for treatment or prophylaxis in a subject with a polymorphism 
associated with end stage renal disease comprising: 

obtaining a sample of biological material containing at least one polynucleotide 
5 from a subject; 



WO 00/491 69 PCT/US00/04251 

46 

analyzing said polynucleotide to detect the presence of at least one 
polymorphism associated with end stage renal disease; and 
treating said subject. 

27. The method of claim 26 wherein said polynucleotide is selected from the group 
consisting of DNA, cDNA, RNA and mRNA. 

28. The method of claim 26, wherein said polymorphsim is located in a gene 
selected from the group consisting of the genes in Table 1 . 

29. The method of claim 26, wherein said polymorphism is located in the TGF-bl 
gene or the TGF-b-RII gene. 

30. The method of claim 26, wherein said polymorphism is a variant selected from 
the group consisting of the variants in Table 2 or the complements thereof. 

3 1 . The method of claim 26 wherein said treatment counteracts the effect of the 
polymorphism. 

32. A method for diagnosing a genetic susceptibility for hypertension in a subject 
comprising: 

obtaining a biological sample containing at least one polynucleotide from said 
subject; and 

5 analyzing said polynucleotide to detect a genetic polymorphism wherein said 
genetic polymorphism is associated with an altered susceptibility for end-stage 
renal disease. 

33. The method of claim 32, wherein said polynucleotide is DNA, RNA, cDNA or 
mRNA. 

34. The method of claim 32, wherein said polymorphism is located in a gene 
selected from the group consisting of the genes in Table 1. 
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35. The method of claim 32, wherein said polymorphism is located in the TGF-b-RII 
gene. 

36. The method of claim 35, wherein said genetic polymorphism is located at 
position 1009. 

37. The method of claim 36, wherein said genetic polymorphism is a G to A 
substitution. 

38. The method of claim 36, wherein said genetic polymorphism is a C to T 
substitution. 

39. The method of claim 32, wherein said analysis is accomplished by sequencing, 
mini sequencing, hybridization, restriction fragment analysis, oligonucleotide 
ligation assay or allele specific PCR. 

40. A method for treatment or prophylaxis in a subject with a polymorphism 
associated with hypertension comprising: 

obtaining a sample of biological material containing at least one polynucleotide 
from a subject; 

5 analyzing said polynucleotide to detect the presence of at least one 
polymorphism associated with end stage renal disease; and 
treating said subject. 

41. The method of claim 40 wherein said polynucleotide is selected from the group 
consisting of DNA, cDNA, RNA and mRNA 

42. The method of claim 40, wherein said polymorphsim is located in a gene 
selected from the group consisting of the genes in Table 1. 

43. The method of claim 40, wherein said polymorophism is located in the TGF-b- 
RII gene. 

44. The method of claim 43 wherein said polymorphism is located at position 1009. 
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45. The method of claim 44, wherein said polymorphism is a G to A substitution 

46. The method of claim 44, wherein said polymorphism is a C to T substitution. 

47. The method of claim 40 wherein said treatment counteracts the effect of the 
polymorphism. 
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FIGURE 1 

1 ggatccttag caggggagta acatggattt ggaaagatca ctttggctgc tgtgtgggga 
61 tagataagac ggtgggagcc tagaaaggag gctgggttgg aaactctggg acagaaaccc 
121 agagaggaaa agactgggcc tggggtctcc agtgagtatc agggagtggg gaatcagcag 
181 gagtctggtc cccacccatc cctcctttcc cctctctctc ctttcctgca ggctggcccc 
241 ggctccattt ccaggtgtgg tcccaggaca gctttggccg ctgccagctt gcaggctatg 

3 01 gattttgcca tgtgcccagt agcccgggca cccaccagct ggcctgcccc acgtggcggc 
361 ccctgggcag ttggcgagaa cagttggcac gggctttcgt gggtggtggg ccgcagctgc 

4 21 tgcatgggga caccatctac agtggggccg accgctatcg cctgcacaca gctgctggtg 
4 81 gcaccgtgca cctggagatc ggcctgctgc tccgcaactt cgaccgctac ggcgtggagt 
541 gctgagggac tctgcctcca acgtcaccac catccacacc ccggacaccc agtgatgggg 
6 01 gaggatggca cagtggtcaa gagcacagac tctagagact gtcagagctg accccagcta 
661 aggcatggca ccgcttctgt cctttctagg acctcggggt ccctctgggc ccagtttccc 
721 tatctgtaaa ttggggacag taaatgtatg gggtcgcagg gtgttgagtg acaggaggct 
781 gcttagccac atgggaggtg ctcagtaaag gagagcaatt cttacaggtg tctgcctcct 
841 gacccttcca tccctcaggt gtcctgttgc cccctcctcc cactgacacc ctccggaggc 
901 ccccatgttg acagaccctc cttctcctac cttgtttccc agcctgactc tccttccgtt 
961 ctgggtcccc ctcctctggt cggctcccct gtgtctcatc ccccggatta agccttctcc 

1021 gcctggtcct ctttctctgg tgacccacac cgcccgcaaa gccacagcgc atctggatca 
1081 cccgctttgg tggcgcttgg ccgccaggag gcagcaccct gtttgcgggg cggagccggg 
1141 gagcccgccc cctttccccc agggctgaag ggacccccct cggagcccgc ccacgcgaga 
1201 tgaggacggt ggcccagccc ccccatgccc tccccctggg ggccgccccc gctcccgccc 
1261 cgtgcgcttc ctgggtgggg ccgggggcgg cttcaaaacc ccctgccgac ccagccggtc 
13 21 cccgccgccg ccgcccttcg cgccctgggc catctccctc ccacctccct ccgcggagca 
13 81 gccagacagc gagggccccg gccgggggca ggggggacgc cccgtccggg gcaccccccc 
1441 ggctctgagc cgcccgcggg gccggcctcg gcccggagcg gaggaaggag tcgccgagga 
1501 gcagcctgag gccccagagt ctgagacgag ccgccgccgc ccccgccact gcggggagga 
1561 gggggaggag gagcgggagg agggacgagc tggtcgggag aagaggaaaa aaacttttga 
1621 gacttttccg ttgccgctgg gagccggagg cgcggggacc tcttggcgcg acgctgcccc 
16 81 gcgaggaggc aggacttggg gaccccagac cgcctccctt tgccgccggg gacgcttgct 
1741. ccctccctgc cccctacacg gcgtccctca ggcgccccca ttccggacca gccctcggga 
1801 gtcgccgacc cggcctcccg caaagacttt tccccagacc tcgggcgcac cccctgcacg 
1861 ccgccttcat ccccggcctg tctcctgagc ccccgcgcat cctagaccct ttctcctcca 
1921 ggagacggat ctctctccga cctgccacag atcccctatt caagaccacc caccttctgg 
1981 taccagatcg cgcccatcta ggttatttcc gtgggatact gagacacccc cggtccaagc 
2041 ctcccctcca ccactgcgcc cttctccctg aggagcctca gctttccctc gaggccctcc 
2101 taccttttgc cgggagaccc ccagcccctg caggggcggg gcctccccac cacaccagcc 
2161 ctgttcgcgc tctcggcagt gccggggggc gccgcctccc ccatg 
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FIGURE 2 
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cccatcaaag 


aagttatgat 


tcaatccacg 


aagaccagga 


gttggcgaaa 


tgaagaaaaa 
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acatgagtac 


tgggaagtgg 


601 


caaaacaatc 
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cttgggtgct 


aggtggagca 


tgagttaaaa 


ttccaggatg 


661 
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atgagagggt 
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gaataaaagc 


acaagcattc 


aggtgactcc 


caacccactt 


ttaattttac 


agtttctgct 
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atactctata 
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attacatttc 
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acttegtgat 


aggtgatcat 
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agaaatgcag 
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1741 


gggcggtgag 


gggcagctga 
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aagctctegg 
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ctaggaaaca 


1801 


tgattggcag 


ctacgagaga 


gctaggggct 


ggaegtcgag 


gagagggaga 


aggctctegg 


1861 


gcggagagag 


gtcctgccca 


get 
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SEQUENCE LISTING 

<1 10> Moskowitz, David W 

<120> DIAGNOSTIC POLYMORPHISMS 

<130>DZG 2176 

<140> 
<141> 

<150> 60/120,787 
<151> 1999-02-19 

<160>27 

<170> Patentln Ver. 2. 1 

<210> 1 

<211>2205 

<212>DNA 

<213> Homo sapiens 

<400> 1 

ggatccttag caggggagta acatggattt ggaaagatca ctttggctgc tgtgtgggga 60 
tagataagac ggtgggagcc tagaaaggag gctgggttgg aaactctggg acagaaaccc 120 
agagaggaaa agactgggcc tggggtctcc agtgagtatc agggagtggg gaatcagcag 180 
gagtctggtc cccacccatc cctcctttcc cctctctctc ctttcctgca ggctggcccc 240 
ggctccattt ccaggtgtgg tcccaggaca gctttggccg ctgccagctt gcaggctatg 300 
gattttgcca tgtgcccagt agcccgggca cccaccagct ggcctgcccc acgtggcggc 360 
ccctgggcag ttggcgagaa cagttggcac gggctttcgt gggtggtggg ccgcagctgc 420 
tgcatgggga caccatctac agtggggccg accgctatcg cctgcacaca gctgctggtg 480 
gcaccgtgca cctggagatc ggcctgctgc tccgcaactt cgaccgctac ggcgtggagt 540 
gctgagggac tctgcctcca acgtcaccac catccacacc ccggacaccc agtgatgggg 600 
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gaggatggca cagtggtcaa gagcacagac tctagagact gtcagagctg accccagcta 660 
aggcatggca ccgcttctgt cctttctagg acctcggggt ccctctgggc ccagtttccc 720 
tatctgtaaa ttggggacag taaatgtatg gggtcgcagg gtgttgagtg acaggaggct 780 
gcttagccac atgggaggtg ctcagtaaag gagagcaatt cttacaggtg tctgcctcct 840 
5 gacccttcca tccctcaggt gtcctgttgc cccctcctcc cactgacacc ctccggaggc 900 

ccccatgttg acagaccctc cttctcctac cttgtttccc agcctgactc tccttccgtt 960 
ctgggtcccc ctcctctggt cggctcccct gtgtctcatc ccccggatta agccttctcc 1020 
gcctggtcct ctttctctgg tgacccacac cgcccgcaaa gccacagcgc atctggatca 1080 
cccgctttgg tggcgcttgg ccgccaggag gcagcaccct gtttgcgggg cggagccggg 1 140 

10 gagcccgccc cctttccccc agggctgaag ggacccccct cggagcccgc ccacgcgaga 1200 

tgaggacggt ggcccagccc ccccatgccc tccccctggg ggccgccccc gctcccgccc 1260 
cgtgcgcttc ctgggtgggg ccgggggcgg cttcaaaacc ccctgccgac ccagccggtc 1320 
cccgccgccg ccgcccttcg cgccctgggc catctccctc ccacctccct ccgcggagca 1380 
gccagacagc gagggccccg gccgggggca ggggggacgc cccgtccggg gcaccccccc 1440 

1 5 ggctctgagc cgcccgcggg gccggcctcg gcccggagcg gaggaaggag tcgccgagga 1 500 

gcagcctgag gccccagagt ctgagacgag ccgccgccgc ccccgccact gcggggagga 1560 
gggggaggag gagcgggagg agggacgagc tggtcgggag aagaggaaaa aaacttttga 1620 
gacttttccg ttgccgctgg gagccggagg cgcggggacc tcttggcgcg acgctgcccc 1680 
gcgaggaggc aggacttggg gaccccagac cgcctccctt tgccgccggg gacgcttgct 1740 

20 ccctccctgc cccctacacg gcgtccctca ggcgccccca ttccggacca gccctcggga 1 800 

gtcgccgacc cggcctcccg caaagacttt tccccagacc tcgggcgcac cccctgcacg 1 860 
ccgccttcat ccccggcctg tctcctgagc ccccgcgcat cctagaccct ttctcctcca 1 920 
ggagacggat ctctctccga cctgccacag atcccctatt caagaccacc caccttctgg 1980 
taccagatcg cgcccatcta ggttatttcc gtgggatact gagacacccc cggtccaagc 2040 

25 ctcccctcca ccactgcgcc cttctccctg aggagcctca gctttccctc gaggccctcc 2100 

taccttttgc cgggagaccc ccagcccctg caggggcggg gcctccccac cacaccagcc 2160 
ctgttcgcgc tctcggcagt gccggggggc gccgcctccc ccatg 2205 



<210>2 
<211> 1883 
30 <212>DNA 

<213> Homo sapiens 
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<400> 2 

cccatcaaag aagttatgat tcaatccacg aagaccagga gttggcgaaa tgaagaaaaa 60 
aaggtcagag gaaggaagtc ctctctgggg aaggctctaa gcataaaggg caggaggatt 120 
acagaggcat atctcgaaat ttggagaagg ctttcagtaa gcaaggagaa gccaaatgaa 180 
agtttacgga gagttggagg cttgaagaca ccgttcaagg atctggtttt tatcttctct 240 
ttattctcaa gagcttagtg ggaagccatt aaatgatttt aatcaaggag gggttggtta 300 
taaactagtt ttgttaattt tgaaaaatct gaattcactc tcgtttgaga aactgagtga 360 
aagagcccag aacggccgtg ctgagggtga ctcctgggaa gactccttaa ccacaagcca 420 
tggcagtggc atgggctggt ggcagaagag ggaataggga gaagatttgg aactcaatct 480 
tcctccattg acaaagtcac tccagctttg gcaaggcaat taattggtgg gaaagaagat 540 
gcctagccct cctgatttca ctgcactttc tgcatcttca acatgagtac tgggaagtgg 600 
caaaacaatc cagaggcagg cttgggtgct aggtggagca tgagttaaaa ttccaggatg 660 
aagcaaatga acacttagaa tgacaggaaa gatttgggag ttgggtttgg gggagggcta 720 
tttaccttta ttccctggag accctggcac aaaccctgcc tctgcaatct tcctctcagg 780 
taaaggaatt cattaaatga attgctagaa gatctactga ccagagggct gtacagaatc 840 
atatctttga gagtgggaag taggttgatc acatagttta ttatccaatc aggacatatc 900 
tgaaagagaa agggggttct attaatattt aaactacaaa acatgtacac caggaatgtc 960 
ttgggcaaat ctggttgccc tagcaagaaa ggaaatttga aagtttatgc tgttctgctc 1020 
ccatgttacc ccgtttgcac atgagagggt aagtattctc tttcttcacc tgcattaagg 1080 
gaataaaagc acaagcattc aggtgactcc caacccactt ttaattttac agtttctgct 1 140 
atactctata cattctgaaa attacatttc ccaccactat acttcgtgat aggtgatcat 1200 
ttacaattac tcactgactc agtcccggga agaggcggtg caaaatggac gctctatcca 1260 
ggtgctcatt agaaatgcag aatctctgcc tgcctcctag acctactgaa ttagaatctg 1320 
catttttaaa taagatttcc aggtgatcaa tatgtacatt aaaacttgag aaaaacctct 1380 
agacttcgac ctaaagaaaa acattttaca acttgacagt gtatgcacat acatacatgc 1440 
atatagacac aactgaagca caaatttaat gaagtagaat ttaccgttac tattttattt 1500 
ggaaagaaat gtgctcgcga ctcaatagat tggagtattc actcctggat ctcaacttgc 1560 
aatttgaaaa cgcatctcta aagcacctag gagcaatctg aagaaagctg aggggaggcg 1620 
gcagatgttc tgatctacta gggaaaacgt ggacgttttc tgttgttact ttgtgaactg 1680 
tgtgcactta gtcattcttg agtaaatact tggagcgagg aactcctgag tggtgtggga 1740 
gggcggtgag gggcagctga aagtcggcca aagctctcgg aggggctggt ctaggaaaca 1800 
tgattggcag ctacgagaga gctaggggct ggacgtcgag gagagggaga aagctctcgg 1 860 
gcggagagag gtcctgccca get 1 883 
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<210>3 
<211>22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 3 

tgcatgggga caccatctac ag 22 

<210>4 
<21 1> 22 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 4 

tcttgaccac tgtgccatcc tc 22 

<210>5 
<211> 12 
<212> DNA 

<21 3> Unknown Organism 
<220> 

<221> mutation 
<222>(1) 

<223> G is replaced by A 
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<220> 

<221> unsure 
<222> (6).. (7) 

<223>N=A orGorCorT 
<220> 

<221> unsure 
<222> (9)..(12) 
<223>N=A orGorCorT 

<220> 

<223> Description of Unknown Organism: potential TCFII 
homodimer 

<400> 5 

gtcatnnwnnnn 12 

<210>6 
<211> 14 
<212>DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222> (3)..(5) 

<223>N=A orGorCorT 
<220> 

<221> mutation 
<222> (8) 

<223> G is replaced by A 



<220> 
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<221> unsure 
<222>(13) 

<223>N=A orGorCorT 
<220> 

<223> Description of Unknown Organism: potential 
activating transcription factor (ATF) site 

<400> 6 

grnnnacgtc asng 14 



<210>7 

<211>12 

<212>DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222>(1) 

<223>N=A orGorCorT 
<220> 

<221> mutation 
<222> (7) 

<223> G is replaced by A 
<220> 

<221> unsure 
<222>(12) 

<223> N= A or G or C or T 
<220> 

<223> Description of Unknown Organism: potential cAMP- 
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responsive element binding protein (CREB) site 
<400> 7 

nstkacgtcasn 12 

<210>8 
<211> 13 
<212>DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222>(1)..(2) 
<223>N=A orGorCorT 

<220> 

<22 1 > unsure 
<222> (8) 

<223> G is replaced by A 
<220> 

<221> unsure 
<222>(13) 

<223>N=A orGorCorT 
<220> 

<223> Description of Unknown Organism: potential cAMP- 
responsive element binding protein (CREB) site 



<400> 8 
nnttackgtc asn 



13 
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<210>9 
<211> 12 
<212> DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222>(1)..(2) 
<223>N=A orGorCorT 

<220> 

<221> mutation 
<222> (7) 

<223> G is replaced with A 
<220> 

<221> unsure 
<222>(12) 

<223>N=A orGorCorT 
<220> 

<223> Description of Unknown Organism: potential cAMP- 
responsive element binding protein (CREB) site 

<400> 9 

nntkacgtca sn 12 

<210> 10 
<211> 12 
<212>DNA 

<213> Unknown Organism 



<220> 
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<221> unsure 
<222>(1)..(2) 

<223> N= A or G or C or T 
<220> 

<221> mutation 
<222> (5) 

<223> G is replaced with A 
<220> 

<223> Description of Unknown Organism: potential cAMP- 
responsive element binding protein (CREB) site 

<400>10 

nnrcgtcancnn 12 

<210>11 
<211>11 
<212>DNA 

<2 1 3> Unknown Organism 
<220> 

<221> unsure 
<222> (2) 

<223>N=A orGorCorT 
<220> 

<221> unsure 
<222>(4) 

<223> N= A or G or C or T 
<220> 

<221> mutation 
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<222> (6) 

<223> G is replaced with A 
<220> 

<223> Description of Unknown Organism: activator protein 
1 (API) site 

<400> 1 1 

wnknagtcas y 11 

<210> 12 
<211>22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 12 

aagacggtgg gagcctagaa ag 22 

<210> 13 
<211>20 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 13 

tgggaccaca cctggaaatg 



20 
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<210> 14 
<211> 14 
<212>DNA 

<213> Unknown Organism 
<220> 

<221> mutation 
<222>(6) 

<223> Y (C or T) is replaced with G 
<220> 

<223> Description of Unknown Organism: potential gut 
enriched Krueppel- like factor (GKLF) site 

<400> 14 

ccyytyyyty ntty 14 

<210> 15 
<211>22 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

aagtcactcc agctttggca ag 22 

<210> 16 
<211>20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

ttgcagaggc agggtttgtg 20 



<210> 17 
<211> 10 
<212>DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222>(1) 

<223>N=A orGorCorT 
<220> 

<221> mutation 
<222> (7) 

<223> C is replaced by T 
<220> 

<221> unsure 
<222>(10) 

<223>N=A orGorCorT 
<220> 

<223> Description of Unknown Organism: potential 
CETS1P54 [c-Ets-l(p54)] site 



<400> 17 
nrcwtcckgn 



10 
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<210> 18 
<211>10 
<212>DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222>(1)..(2) 

<223> N= A or G or C or T 
<220> 

<221> misc_feature 

<222> (6) 

<223> A is deleted 

<220> 

<221> unsure 
<222>(9)..(10) 
<223>N=A orGorCorT 

<220> 

<223> Description of Unknown Organism: potential SOX5 
binding site 

<400> 18 

nnaacaatnn 10 



<210> 19 
<211>16 
<212>DNA 

<213> Unknown Organism 



<220> 
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<221> misc_feature 

<222> (2) 

<223> A is deleted 

<220> 

<221> unsure 
<222>(ll) 

<223>N=A orGorCorT 
<220> 

<221> unsure 
<222>(13)..(16) 
<223>N=A orGorCorT 

<220> 

<223> Description of Unknown Organism: potential Thing 
1/E47 heterodimer (TH1E47) binding site 

<400> 19 

aawkccagay ncnnnn 1 6 

<210>20 
<211>16 
<212> DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222>(11) 

<223>N=A orGorCorT 
<220> 

<221> unsure 



WO 00/49169 PCT/USOO/04251 

15 

<222>(13)..(16) 
<223>N=A orGorCorT 



<220> 

<221> misc_feature 
<222> (14) 
<223> N is deleted 



<220> 

<223> Description of Unknown Organism: Thing 1/E47 
heterodimer (TH1E47) binding site 

<400> 20 

aawkccagay ncnnnn 1 6 



<210>21 
<211> 19 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400>21 

ggagttgggt ttgggggag 19 

<210> 22 
<211>23 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: primer 
<400> 22 

tcttgctagg gcaaccagat ttg 23 

<210>23 
<211> 18 
<212> DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222>(1)..(2) 

<223> N= A or G or C or T 
<220> 

<221> unsure 
<222> (8)..(13) 
<223>N=A orGorCorT 

<220> 

<221> unsure 
<222>(16)..(18) 
<223>N=A orGorCorT 

<220> 

<221> misc_feature 
<222>(16) 

<223> T is inserted after N 

<220> 

<223> Description of Unknown Organism: potential nuclear 
factor- 1 (NF-1) site 
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<400> 23 

nnttggcnnn nnnccnnn 18 



<210> 24 
<211> 12 
<212>DNA 

<213> Unknown Organism 
<220> 

<221> unsure 
<222>(1)..(5) 

<223> N= A or G or C or T 
<220> 

<221> mutation 
<222> (8) 

<223> A is replaced with G 
<220> 

<221> unsure 
<222>(12) 

<223>N=A orGorCorT 
<220> 

<223> Description of Unknown Organism: S8 binding site 
<400> 24 

nnnnnyaatt an 12 

<210>25 
<211>13 
<212> DNA 
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<213> Unknown Organism 
<220> 

<221> mutation 
<222> (4) 

<223> A is replaced with G 
<220> 

<221> unsure 
<222> (6)..(7) 
<223> N= A or G or C or T 

<220> 

<221> unsure 
<222> (9).. (13) 
<223>N=A orGorCorT 

<220> 

<223> Description of Unknown Organism: TCF1 1/KCR-Fl/Nrfl 
homodimer (TCF1 1) binding site 

<400> 25 

gtcatnnwnn nnn 1 3 



<210>26 
<211>25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 26 
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ggacatatct gaaagagaaa ggggg 25 



<210>27 
<211>22 
<212> DNA 

<213> Anificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 27 

ttgggagtca cctgaatgct tg 



22 
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